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A NEW METHOD TO DIRECTLY MEASURE THE JEANS SCALE OF THE INTERGALACTIC MEDIUM 

USING CLOSE QUASAR PAIRS 
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ABSTRACT 

Although the baryons in the intergalactic medium (IGM) trace dark matter fluctuations on Mpc 
scales, on smaller scales ~ 100 kpc, fluctuations are suppressed because the finite temperature gas is 
pressure supported against gravity, analogous to the classical Jeans argument. This Jeans filtering 
scale, which quantifies the small-scale structure of the IGM, has fundamental cosmological impli- 
cations. First, it provides a thermal record of heat injected by ultraviolet photons during cosmic 
reionization events, and thus constrains the thermal and reionization history of the Universe. Second, 
the Jeans scale determines the dumpiness of the IGM, a critical ingredient in models of cosmic reion- 
ization. Third, it sets the minimum mass scale for gravitational collapse from the IGM, and hence 
plays a pivotal role in galaxy formation. Unfortunately, it is extremely challenging to measure the 
Jeans scale via the standard technique of analyzing purely longitudinal Lya forest spectra, because 
the thermal Doppler broadening of absorption lines along the line-of-sight, is highly degenerate with 
Jeans smoothing. In this work we show that the Jeans filtering scale can be directly measured by 
characterizing the coherence of correlated Lya forest absorption in close quasar pairs, with separations 
small enough ~ 100 kpc to resolve it. We present a novel technique for this purpose, based on the 
probability distribution function (PDF) of phase angle differences of homologous longitudinal Fourier 
modes in close quasar pair spectra. A Bayesian formalism is introduced based on the phase angle PDF, 
and MCMC techniques are used to characterize the precision of a hypothetical Jeans scale measure- 
ment, and explore degeneracies with other thermal parameters governing the IGM. A semi-analytical 
model of the Lya forest is used to generate a large grid (500) of thermal models from a dark matter 
only simulation. Our full parameter study indicates that a realistic sample of only 20 close quasar 
pair spectra can pinpoint the Jeans scale to ~ 5% precision, independent of the amplitude Tq and 
slope 7 of the temperature-density relation of the IGM T — To{p/ p)'^^^ . This exquisite sensitivity 
arises because even long-wavelength ID Fourier modes ^ 10 Mpc, i.e. two orders of magnitude larger 
than the Jeans scale, are nevertheless dominated by projected small-scale 3D power. Hence phase 
angle differences between all modes of quasar pair spectra actually probe the shape of the 3D power 
spectrum on scales comparable to the pair separation. We show that this new method for measuring 
the Jeans scale is unbiased and is insensitive to a battery of systematics that typically plague Lya 
forest measurements, such as continuum fitting errors, imprecise knowledge of the noise level and/or 
spectral resolution, and metal-line absorption. 

Subject headings: cosmology: large-scale structure - quasars: absorption lines - intergalactic medium 
- reionization 



1. INTRODUCTION 

The imprint of redshifted Lyman-a (Lya) forest ab- 
sorption on the spectra of distant quasars provides an 
exquisitely sensitive probe of the distribution of baryons 
in the intergalactic medium (IGM) at large cosmolog- 
ical lookback times. Among the remarkable achieve- 
ments of modern cosmology is the ability of cosmolog- 
ical hydrodynamical simulations to explain the origin 
of this absorption pattern, and reproduce i ts statisti- 
cal properties to percent level accuracy (e.g. iCen et al.l 
[l99l iMiralda-Escude et"al|[l99l [Rauchlll998[ ). But the 
wealth of information which can be gathered from the 
Lya forest is far from being exhausted. The thermal 
state of the baryons in the IGM refiects the integrated 
energy balance of heating — due to the collapse of cos- 
mic structures, radiation, and possibly other exotic heat 
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sources — and cooling due to the expansion of the Uni- 
verse (e.g. IMiralda-Escude fc Ree^ '1994"; IHui fc GnedinI 



llQQTtlHui fc Haiman 2003; Mciksin 2009). Cosmologists 
still do not understand how the interplay of these physi- 
cal processes sets the thermal state of the IGM, nor has 
this thermal state been precisely measured. 

There is ample observational evidence that ultravi- 
olet radiation emitted by the first star-forming galax- 
ies ended the 'cosmic dark ages' ionizing hydrogen and 
singly ionizing helium at z ^ 10 (e.g. Barkana &: Loea 
ICiardi fc Ferraral 120051 : iFan et al.l 120061 : iZarouM 
A second and analogous reionization episode 

is belie ved to have occurred at later times z ~ 

3-4 (iMadau fc MeiksinI 11994 iJakobsen et al.l 119941: 



2001 



20131) 



iReimers et al.l Il997t iCroft et aLl (l997| ). when quasars 
were sufficiently abundant to supply the hard photons 
necessary to doubly ionized helium. The most re- 
cent observations from HST/COS provide tentative ev- 
idence for an extended He II reionization from z ~ 
2.7 - 4 (IShull et all I2OIOI: iFurlanetto fc DixonI I2OIOI: 
IWorsecket al.l 120111 Worseck et al. 2013, m prepara- 
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tion), with a duration of ~ 1 Gyr, longer than naively ex- 
pected. Cosmic reionization events are watersheds in the 
thermal history of the Universe, photoheating the IGM 
to tens of thousands of degrees. Because cooling times 
in the rarefied IGM gas arc long, memory of this heating 
is retained ( Miralda -Escude & Rccs 1 994c iHui fc GnedinI 
1997; Hachnch & Steinmetz 1998; H ui fc Hai man 2003; 
Theuns et al.. .200281 11}). Thus an empirical characteriza- 



tion of the IGMs thermal history constrains the nature 
and timing of reionization. 

From a theoretical perspective, the impact of reion- 
ization events on the thermal state of the IGM is 
poorly understoo d. Radiative tran sf er simulations of 
both hydrogen (iBolton et all 120041: [Iliev et all 120061: 



Tittlev & Meiksin"2007a') and h elium (lAb el k H aehneltl 
1999; McQuinn et al. 2009; Mci ksin fc Ti ttlev 20ll) re- 
veal that the heat injection and the resulting temper- 
ature evolution of the IGM depends on the details of 
how and when reionization occurred. There is evi- 
dence that the thermal vestiges of H I reionization 
heating may persist until as late as z ~ 4 — 5, and 
thus be observable in the Lya forest (iHui fc HaimanI 
|2003; Furlanctto & Oh 2009; CeneEiD 120091) , whereas 
for Hell reionization at z ^ 3, the Lya forest is ob- 
servable over the full duration of the phase transition. 
Finally, other processes could inject heat into the IGM 
and impact its thermal state, such as the large-scale 
structure shocks which eventually pro duce the Warm 
Hot Ir itergalactic Mediu ni (WIIIM;e.g. iCen fc Ostriked 
119991 : iDave et a ll 119 991 [2001.1 . hea ti ng from galac 



tic o utflows (KoUmci er et al.l 120061: iCen fc Ostriker 



2006 ). photoelectric heat i ng of dust grains ()Nath et al' 
1991 llnoue fc Kamaval [2003[) . cosmic-ray heating 



Nath fc BiermannI 1199 3V Compton- heating from the 
hard X-ray bac kground (Madau fc Efstathiou 199£|^ X- 
ray preheating (Ricotti et al. 2005; Tanaka ct al. 2012a), 
or blazar heating ( i Broderick et al.. ,2012i : .Chang et alj 



120121: iPfrommer et al.ll2012l : iPuchwein et aLll2012^ . Pre- 
cise constraints on the thermal state of the IGM would 
help determine the relative importance of photoheating 
from reionization and these more exotic mechanisms. 

Despite all the successes of our current model of the 
IGM, precise constraints on its thermal state and con- 
comitant constraints on reionization (and other exotic 
heat sources) remain elusive. Attempts to character- 
ize the IGM thermal state from Lya forest measure- 
ments have a long history. In the simplest picture, 
the gas in the IGM obeys a power law temperature- 
density relation T = Tq(p/ p)'^~^ , which arises from 
the balance between photoionizat ion heating, and coo l- 
ing due to adiabatic expansion (|Hui fc GnedinI [19971) . 
The standard approach has been to compare mea- 
surements of various statistics of the Lya forest to 
cosmological hydrodynamical simulations. Leveraging 
the dependence of these statistics on the underlying 
temperature-density relation, its slope and amplitude 
(To, 7) parameters can be constrained. To this end a 
wide variety of stati stics have been employe d, such as 
the p ower spectrum (jZaldarriaga et al. 2001; iViel et aP 
|2009| ) or analogous sta tistics quantifying the small-scale 



power like wavelets (iTheuns et alT l2002bl: iLidz et al" 



20091 iGarzilh et al.|[2012D or the cu rvature dBecker et al 



20111). The flux PDF ([.McDonald et al.| 



Kim et al.l [2007t IBolton et all [2008; .Calura et al.l 



2000; 



2012 



iGarzilli et al. [2012[) and the shape of the 6-parameter 
distri b ution (iHaehnelt fc Steinmet3ll998t I Theuns et al.l 



2000t iRicotti et al .1 '2000!; 'Bryan fc Macha cekl 120001: 



Schave et al l 120001: (McD onald ct al. 2001; Th euns et all 
2002al: iRudie et al.l r [2012 ) have also been considered. 



Multiple statistics h ave also been combin ed such as the 
PDF and wavelet s (IGarzilli et all I2012D. or PDF and 
power spectrum (jViel et al.l I2009D . Overall, the re- 
sults of such comparisons are rather puzzling. First, 
the IGM appears to be generally too h ot, bo th at low 
(z_~ 2) and high {z ~ 4) redshift (|Hui fc Haima3 
120031 ). In p articular, the high i nferred temperatures at 
z ~ 4 f e.g. iSchave et al.l 120001: lZalda.rriaga et al.l 120011: 



McDonald et al.l 120011: iTheuns et al.l l2002bl: ILidz et al l 
200! " suggest th at Hell was reionized at still higher 



redshift z > 4 (|Hui fc HaimanI l2003f ). possibly con- 
flicting with the late z ~ 2.7 reioni zation of Hell ob- 
served in HST/C O S spectra (Furlanetto fc DixonI 120101: 
Shull et all \20Wl IWorseck et all 120111: ISvohers et al.l 



2OI2I VV^seck et al. 2013, in preparation). Sec- 



ond, IBolton et all (|2008l ) considered the PDF of high- 
resolution quasar spectra and concluded that, at z ~ 3 
the slope of the temperature-density relation 7 is either 
close to isothermal (7 — 1) or even inverted (7 < 1), 
suggesting "that the voids in the IGM may be signifi- 
cantly hotter and the thermal state of the low-density 
IGM may be substantially more complex than is usually 
assumed." Although this result is corroborated by addi- 
tional work emp loying differen t statistics / methodologies 
(jViel et al.ll2roa: Calura et al.l[2C)12: Ga rzilh et al.l 1201 21 
but see Lee et al. 2012), radiative transfer simulations 
of Hell reionization cannot produce an isothermal or 
inverted slope, unless a population other than quasars 
reionized H cII (Bolton e t a"Lll2004l : iMcQuinn et al.ll2009l : 
iMeiksin fc Tittley 2012) , which would fly in the face of 
conventional wisdom. To summarize, despite nearly a 
decade of theoretical and observational work, published 
measurements of the thermal state of the IGM are still 
highly confusing, and concomitant constraints on reion- 
ization scenarios are thus hardly compelling. 

Fortunately, there is another important record of the 
thermal history of the Universe: the Jeans pressure 
smoothing scale. Although baryons in the IGM trace 
dark matter fluctuations on large Mpc scales, on smaller 
scales < 100 kpc, gas is pressure supported against 
gravitational collapse by its finite temperature. Anal- 
ogous to the classic Jeans argument, baryonic fiuctu- 
ations are suppressed relative to the pressureless dark 
matter (which can collapse) , and thus small-scal e power 
is 'filtered' from the IGM (jCnedin fc Huil [19981) . which 
explains why it is sometimes referred to as the filtering 
scale. Classically the comoving Jeans scale is defined 
as A J — ^JiTcl/Gpil + z), but in reality the amount of 
Jeans filtering is sensitive to both the instantaneous pres- 
sure and hence temperature of the IGM, as well as the 
temperature of the IGM in the past. This arises because 
fluctuations at earlier times expanded or failed to col- 
lapse depending on the IGM temperature at that epoch. 
Thus the Jeans scale reflects the competition between 
gravity and pressure integrated over the Universe's his- 
tory, and cannot be expressed as a mere deterministic 
function of the instantaneous thermal state. Heuristi- 
cally, this can be understood because reionization heat- 
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ing is expected to occur on the reionization timescales 
of several hundreds of Myr, whereas the baryons re- 
spond to this heating on the sound-crossing timescale 

Xy[c,{l + z)] ~ (Gp)~^/^ which at mean density is com- 
p arable to the H u bble t ime tH- 

iGnedin fc Hull (|1998| ) considered the behavior of the 
Jeans smoothing in linear theory, and derived an analyt- 
ical expression for the filtering scale Xj as a function of 
thermal history 



dt'a\t'){X"j{t')fx 



{D+{t') + 2H{t')D+{t')) , 2 



dt" 



(1) 



a2(t")' 



where D^{t) is the linear growth function at time t, 
a{t) is the scale factor, and H{t) the Hubble expansion 
rate. Although this simple linear approximation pro- 
vides intuition about the Jeans scale and its evolution, 
Fourier modes with wavelength comparable to the Jeans 
scale are already highly nonlinear at z ~ 3, and hence 
this simple linear pictures breaks down due to nonlinear 
mode-mode coupling effects. Thus given that we do not 
know the thermal history of the Universe, that we ex- 
pect significant heat injection from HeH reionization at 
z 3 — 4 concurrent with the epoch at which we observe 
the IGM, and that IGM modes comparable to the Jeans 
scale actually respond non-linearly to this unknown heat- 
ing, the true relationship between the Jeans scale and the 
temperature-density relation at a given epoch should be 
regarded as highly uncertain. 

Besides providing a thermal record of the IGM, the 
small-scale structure of baryons, as quantified by the 
Jeans scale, is a fundamental ingredient in models of 
reionization and galaxy formation. A critical quantity 



tor of the IGM C 



19991: iMiralda-Escude et al.l 



2009[ iHaardt fc Madaul 1201 



I 



njf (e.g. 


Madau et al. 


120001 


Pawlik et al. 


lEmberson et al.l 120131: 



McQuinn et al.li2011i) , because it determines the average 
number of recombinations per atom, or equivalently the 
total number of UV photons needed to keep the IGM 
ionized. The clumping and the Jeans scale are directly 
related. Specifically, 



C = l 



'IGM 



1 



d\nk ■ 



27r2 



(2) 



where CTjqj^j is the variance of the IGM density, and 
-PiGM(fc) is the 3D power spectrum of the baryons 
in the IGM. Given the shape of PiGuik), the inte- 
gral above is dominated by contributions from small- 
scales (high-Zc), and most important is the Jeans cut- 
off A J, which determines the maximum fc-mode fcj ~ 
1/Aj contributing. The small-scale structure of the 
IGM strongly influences the propagati on of cosmologica l 
ionization fronts during reionization (jlliev "eFan[200l . 
Furthermore, several numerical studies have revealed 
that the hydrodynamic response of the baryons in the 
IGM to impuls i ye reionization heatin g is significant (e.g. 
GnedinI l2000al: iHaiman et al.l [2OOII: iKuhlen fc Madaul 



200a ICiardi fc Salvaterrafl2007t IPawlik et al.l 120091 ^71^ 

dicating that a full treatment of the interplay between 
IGM small-scale structure and reionization history prob- 



ably requires coupled radiative transfer hydrodynamical 
simulations. 

Reionization heating also evaporates the baryons from 
low-mass halos o r prevents gas from col l apsing in them 
altogether (e.g. IBarkana fc Loebl 119991 : IDiikstra et al.l 
120041 ) ■ an effect typically modeled via a critic al mass, 
below which galax i es cannot form (|Gnedi n 20003 



iBuUock et al.' I 2000t iBen son et al, 
2002; Kulkarni fcChoudhurv 20111 



Somcrville 



2002bllal: . 

[Gnedin (2000b|) 
used hydrodynamical simulations to show that this scale 
is well approximated by the filtering mass, which is the 
mass-scale corresponding to the Jea ns filtering length , 
i.e. Mf(z) = 47rp AV3 (see also iHoeft et all 120061: 
lOkamoto et al.l 120081 ). Finally, because the Jeans scale 
has memory of the thermal events in the IGM (see 
eqn. [T]), its value at later times can potentially con- 
strain models of early IGM preheating. In this scenario, 
heat is globally injected into the IGM at high-redshift 
z ~ 5 — 15 from blast- waves pr oduced by outfiows frorn 
proto-galaxies or miniquasars (IWtl ri99a IMadaul I2OOOI: 
[Madau et al. 2001; Ccn fc Bryan 2001; (Theuns et al l 



2001; Benson fc Madar u2003i; iScannapieco et al.l 1200'! 
Scannapicc o fc Oh .2001) X -ray radiation from e arly 



miniquasars (|Tanaka et all I2012bl: IParsons et all [2013) , 
which sets an entropy floor in the IGM and the raises fll- 
tering mass scale inhibiting the formation of early galax- 
ies. 

A rough estimate of the filtering scale at z = 3 
can be obtained from eqn. ([1]) and the following sim- 
plified assumptions; the temperature at z = 3 is 
T(z = 3) ~ 15000 K as suggested by measurements (e.g. 
Schave et alll2000l:lRicotti et al.ll2000l: iZaldarriaga et al.l 



200lHLidz et al.ll2009D . temperature evolves as T oc 1 -I- z 
the typical overdensity prob ed by the z = 3 Lya for- 
est is (5 ~ 2 (Becker et alj 120111 ). One then obtains 
Aj(z = 3) ~ 340 kpc (comoving), smaller than the clas- 
sical or instantaneous Jeans scale Aj by a factor of 3. 
This distance maps to a velocity interval vj — HaXj w 
26 km s~^ along the line of sight due to Hubble expan- 
sion. Thermal Doppler broadening gives rise to a cutoff 
in the longitudinal power spectrum, which occurs at a 
comparable velocity vth ~ 11.3 km s"""^, for gas heated to 
the same temperature. The similarity of the character- 
istic scale of 3D Jeans pressure smoothing and the ID 
thermal Doppler smoothing suggests that disentangling 
the two effects will be challenging given purely longi- 
tudinal obser v ations of the Lya forest, as confirmed by 
IPeeples et al.l ()2009aD . who considered the relative im- 
pact of thermal broadening and pressure smoothing on 
various statistics applied to longitudinal Lya forest spec- 
tra. Previous work that has aimed to measure thermal 
parameters such as Tq and 7 from Lya forest spectra, 
have largely ignored the degeneracy of the Jeans scale 
with these thermal parameters. The standard approach 
has been to assume values of the Jeans scale from a hy- 
drodynamical simulation (e.g. iLidz et al.l l2009; Viel et al.l 
l2009t [Beck er et al.l I20TI . which as per the discussion 
above, is equivalent to assuming perfect knowledge of the 
IGM thermal history. Because of the degeneracy with the 
Jeans scale, it is thus likely that previous measurements 
of the thermal parameters Tq and 7 are significantly bi- 
ased, and their error bars signiflcantly underestimated, 
if indeed Jeans scale takes on values different from those 
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assumed (but see Zaldarriaga et al. 2001 who marginal- 
ized over the Jeans scale, and Becker et al. 2011 who 
also considered its impact). We will investigate such de- 
generacies in detail in this paper with respect to power- 
spectra, and we consider degeneracies for a broader range 
of IGM statistics in a future work (A.Rorai et al. 2013, 
in preparation). 

The Jeans filtering scale can be directly measured using 
close quasar pair sightlines which have comparable trans- 
verse separations r± < 300 kpc (comoving; A9 < 40" 
at 2; = 3). The observable signature of Jeans smooth- 
ing is increasingly coherent absorption between spec- 
tra at_2rogressiyelY_smaller pair separations resolving 
it (jPeeples et al.l l2009b[ ). The idea of using pairs to 
constrain the small scale structure of the IGM is not 
new. However, all previous measurements have either 
focused on lensed quasars, which probe ext remely small 
transverse distances r± ^ 1 kpc <^ A r fe.g. lYoung et aTl 



198lHMcGil]| 



Ranch et al.1 



19901: iPetrv et al.lll998l:ISmette et al 



ig et al.l 
il.1 119951: 



2001[ ) such that the Lya forest is essen- 



tially perfectly coherent, or real physical quas ar pairs 
with r_L ~ 1 Mpc > Xj (jP'Odorico et al.l 120061 ) far too 
large to place useful constraints on the Jeans scale. Ob- 
servationally, the breakthrough enabling a measurement 
of the Jeans scale is the dis covery o f a large number of 
close quasar pairs (iHennawii 2004: H ennawi et al.ll2006bl : 
iMvers et al.] 120081: iHennawi et al.i [200^ with ~ 100 kpc 
separations. By appl y ing machine learn ing techniques 
(jRichards et al.l [200I IBovv ct al. 2011, 2012) to the 
Sloan Digital Sky Survey (SDSS; York e t al. 2000.) imag- 
ing, a sample of ^ 300 close r± < 700 k pc quasar pairs 
at 1.6 < z < 4.j3 ha s been uncovered ( Hennawi 120041 : 
IHennawi et al]|2006bl |2009) . 

In this paper we introduce a new method which will 
enable the first determination of the Jeans scale, and we 
estimate the precision with which it can be measured 
from this close quasar pair dataset. We explicitly con- 
sider degeneracies between the canonical thermal param- 
eters To and 7, and the Jeans scale Aj, which have been 
heretofore largely ignored. To this end, we use an ap- 
proximate model of the Lya forest based on dark mat- 
ter only simulations, allowing us to independently vary 
all thermal parameters and simulate a large parameter 
space. The structure of this paper is as follows: we de- 
scribe how we compute the Lya forest flux transmission 
from dark matter simulations, and our parametrization 
of the thermal state of the IGM in section § [2l In § [3] 
we consider thermal parameter degeneracies which re- 
sult when only longitudinal observations are available, 
and we show how the additional transverse information 
provided by quasar pairs can break them. In § |4]we intro- 
duce our new method to quantify absorption coherence 
using the difference in phase between homologous longi- 
tudinal Fourier modes of each member of a quasar pair. 
We focus on the probability distribution function (PDF) 
of these phase differences, and find that the shape of this 
phase PDF is very sensitive to the Jeans smoothing. A 
Bayesian likelihood formalism that uses the phase angle 
PDF to determine the Jeans scale is presented in § [5l 
Our Bayesian method allows us to combine the Jeans 
scale information with other Lya forest statistics such 

* The lower redshift limit is corresponds to Lya forest absorption 
being above the atmospheric cutoff. 



as the longitudinal power spectrum, and we conduct a 
Markov Chain Monte Carlo (MCMC) analysis in this sec- 
tion to determine the resulting precision on Tp, 7, and 
Aj expected for realistic datasets, explore parameter de- 
generacies, and study the impact of noise and systematic 
errors. We conclude and summarize in § [71 

Throughout this paper we use the ACDM cosmological 
model with the parameters fim = 0.28, f^A = 0.72, h — 
0.70, n — 0.96, as = 0.82. All distances quoted are in 
comoving kpc. 

2. SIMULATION METHOD 
2.1. Dark Matter Simulation 

Our model of the Lya forest is based on a single snap- 
shot of a dark matter only simulation at z = 3. In 
this scheme, the dark matter simula tion pro vides the 
dark matter density an d velocity field (|Croft et al. 199^ 
iMeiksin fc Whitell200H ) , and the gas density and temper- 
ature are computed using simple scaling relations moti- 
yated by the results of full hydrodynamical simulations 
(IHui fc Gnedinl[l997l: l&iedin fc Huil 119981: IGnedin et al.l 
2003). Our objective is then to explore the sensitiv- 
ity with which close quasar pairs can be used to con- 
strain the thermal parameters defining these scaling re- 
lations, and in particular the Jeans scale. To this end, 
we require a dense sampling of the thermal parameter 
space, which is computationally feasible with our semi- 
analytical method applied to a dark matter simulation 
snapshot, whereas it would be extremely challenging to 
simulate such a dense grid with full hydrodynamical sim- 
ulations. We do not model the redshift evolution of the 
IGM, nor do we consider the effect of uncertainties on 
the cosmological parameters, as they are constrained by 
various large-scale structure and CMB measurements to 
much higher precision than the thermal parameters gov- 
erning the IGM. 

We used an up dated v e rsion version of the TreePM 
code described in iWhitd ()2002|) to evolve 1500^ equal 
mass (3 X 10^ /i-^Mq) particles in a periodic cube of 
side length Lbox = 50/i~^Mpc with a Plummer equiv- 
alent smoothing of 1.2/i~^kpc. The initial conditions 
were generated by displacing particles from a regular 
grid using second order Lagrangian perturbation theory 
at z = 150. This TreePM code has been compared to 
a number of other codes and h as been shown to per - 
form well for such simulations (jHeitmann et al.l l2008fl . 
Recently the code has been modified to use a hybrid 
MPI-|-OpenMP approach which is particularly efficient 
for modern clusters. 



2.2. Description of the Intergalactic Medium 

The baryon density field is obtained by smoothing the 
dark matter distribution; this smoothing mimics the ef- 
fect of the Jeans pressure smoothing. For any given ther- 
mal model, we adopt a constant filtering scale Aj, rather 
than computing it as a function of the temperature, and 
this value is allowed to vary as a free parameter (see 
discussion below). The dark matter distribution is con- 
volved with a window function Wigm, which, in Fourier 
space, has the effect of quenching high-fc modes 



SiGuik) = WiGuik, Xj)6BM{k) 



(3) 
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For example a Gaussian kernel with a — Aj, WiGM(fc) = 
exp(— fc^Aj/2), would truncates the 3D power spectrum 
at fc ~ 1/Aj. 

Because we smooth the dark matter particle distribu- 
tion in real-space, it is more convenient to adopt a func- 
tion with a finite-support 

8\Qyy{x) <yi^^m,iK(\x - Xi\,Rj) (4) 

i 

where and Xi are the mass and position of the particle 
i, K{r) is the kernel, and i?j the smoothing parameter 
which sets the Jeans scale. We adopt the followoing cubic 
spline kernel 




In the central regions the shape of K(r) very closely re- 
sembles a Gaussian with a ~ Rj /3.25, and we will hence- 
forth take this Rj jZHh to be our definition of Aj, which 
we will alternatively refer to as the 'Jeans scale' or the 
'filtering scale'. The analogous smoothing procedure is 
also applied to the particle velocities; however, note that 
the velocity field has very little small-scale power, and so 
the velocity distribution is essentially unaffected by this 
pressure smoothing operation. As we discuss further in 
Appendix [Aj the mean inter-particle separation of our 

1/3 

simulation cube (5/ = ibox/A^p sets the minimum Jean 
smoothing that we can resolve with our dark matter sim- 
ulation, hence we can safely model values of Aj > 50 kpc. 

At the densities typically probed by the Lya forest, the 
IGM is governed by relatively simple physics. Most of 
the gas has never been shock heated, is optically thin to 
ionizing radiation, and can be considered to be in ioniza- 
tion equilibrium with a uniform UV background. Under 
these conditions, the competition between photoioniza- 
tion heating and adiabatic expansion cooling gives rise to 
a tight relation between temperature and density which is 
well approximated by a power law (|Hui fc Gnedidll997l ). 

T(<5)=ro(l + 5)^-1 (6) 

where Tq, the temperature at the mean density, and 7, 
the slope of the temperature-density relation, both de- 
pend on the thermal history of the gas. We thus fol- 
low the standard approach, and parametrize the thermal 
state of the IGM in this way. Typical values for Tq are on 
the order of 10'* K, while 7 is expected to be around unity, 
and asymptotically approach the value of 700 = 1-6, if 
there is no other heat injection besides (optically thin) 
photoionzation heating. Recent work suggests that an 
inverted temperature-density relation 7 < 1 provides a 
better match to the flu x probability distribution of the 
Lya forest (B olton et al . 2008) , b ut the rob ustness of this 
measurement has been debated (|Leell2012[ ). 

The optical depth for Lya absorption is proportional 
to the density of neutral hydrogen n///, which, if the gas 
is highly ionized (xm 1) a nd in photoi onization equi- 
librium, can be calculated as ()Gunn fc Peterson igSS) 

nm = aiT)njj/r (7) 



where F is the photoionization rate due to a uniform 
metagalactic ultraviolet background (UVB), and a{T) is 
the recombination coefficient which scales as T~°-^ at 
typical IGM temperatures. These approximations re- 
sult in a power law relation between Lya optical depth 
and overdensity often referred as the fiuctuating Gunn- 
Petersonn approximation (FGPA) r oc (1 + 5)2-0.7(7-1)^ 
which does not include the effect of peculiar motions and 
thermal broadening. We compute the observed optical 
depth in redshift-space via the following convolution of 
the real-space optical depth 

/oo 
T{x)^{Hax + Wp.ii (x) - V, b{x))dx, (8) 
-00 

where Hax is the real-space position in velocity units, 
t;pj|(x) is the longitudinal component of the peculiar ve- 
locity of the IGM at location x, and $ is the normalized 
Voigt profile (which we approximate with a Gaussian) 

characterized by the thermal width b = \J 2K bT j rnc? , 
where we compute the temperature from the baryon den- 
sity via the temperature-density relation (see eqn. [5]). 
The observed flux transmission is then given by F{v) = 

We apply the aforementioned recipe to 2x 100^ lines-of- 
sight (skewers) running parallel to the box axes, to gen- 
erate the spectra of 100^ quasar pairs, and we repeat this 
procedure for 500 different choices of the parameter set 
(To, 7, Aj). Half of the spectra (the first member of each 
pair) are positioned on a regular grid in the y—z plane, in 
order to distribute them evenly in space. Subsequently, 
a companion is assigned to each of them, and our choice 
for the distribution of radial distances warrants further 
discussion. Our goal is to statistically characterize the 
coherence of pairs of spectra as a function of impact pa- 
rameter, and near the Jeans scale this coherence varies 
rapidly with pair separation. Hence computing statistics 
in bins of transverse separation is undesirable, because 
it can lead to subtle biases in our parameter determi- 
nations if the bins are too broad. To circumvent these 
difficulties, we focus our entire analysis on 30 linearly- 
spaced discrete pair separations between and 714 kpc. 
For each of the 100^ lines-of-sight on the regular grid, 
a companion sightline is chosen at one of these discrete 
radial separations, where the azimuthal angle is drawn 
from a uniform distribution. 

We follow the standard approach, and treat the meta- 
galactic photoionization rate F as a free parameter, 
whose value is fixed a posteriori by requiring the mean 
fiux of our Lya s kewers (exp(— r)) t o match the mea- 
sured values from iFaucher-Giguere e t al. (2007). This 
amounts to a simple constant re-scaling of the optical 
depth. The value of the mean fiux at z = 3 is taken 
to be fixed, and thus assumed to be known with infi- 
nite precision. This is justified, because in practice, the 
relative measurement errors on the mean flux are very 
small in comparison to uncertainties of the thermal pa- 
rameters we wish to study. In a future work, we conduct 
a full parameter study using other Lya forest statistics, 
and explore the effect of uncertainties of the mean flux 
(A.Rorai et al. 2013, in preparation). Examples of our 
spectra are shown in Figure [TJ 

To summarize, our models of the Lya forest are 
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Fig. 1. — An example of three simulated spectra. The left and the right panels represent the same spectra in the simulation calculated 

for two models with different Jeans smoothing length Aj. The middle and the lower panel represent two spectra respectively at separation 

0.5 Mpc and 1 Mpc from the top one. The coloured sine curves track homologous Fourier modes in each spectrum, with rescaled mean 

and amplitude to fit the range [0, 1]. The wave shifts provide a graphical visualization of phase differences, which we will use to quantify 

spectral coherence and probe the Jeans scale. The right panels suggest that a larger Xj results in greater spectral coherence and generally 

smaller phase differences between neighboring sightlines. 

uniquely described by the three parameters (Tb,7, Aj), 
and we reiterate that these three parameters are consid- 
ered to be independent. In particular the Jeans scale is 
not related to the instantaneous temperature at mean 
density Tq. Although this may at first appear unphys- 
ical, it is motivated by the fact that Xj depends non- 
linearly on the entire thermal history of the IGM (see 
eqn. [Ij, and both this dependence and the thermal his- 
tory are not well understood, as discussed in the in- 
troduction. Allowing Aj to vary independently is the 
most straightforward parametrization of our ignorance. 
However, improvements in our theoretical understand- 
ing of the relationship between Aj and the thermal his- 
tory of the IGM (TqiT) could inform more intelligent 
parametrizations. Furthermore, inter-dependencies be- 
tween thermal parameters can also be trivially included 
into our Bayesian methodology for estimating the Jeans 
scale as conditional priors, e.g. P{Xj,Tq), in the param- 
eter space. 



3. POWER SPECTRA AND THEIR DEGENERACIES 

Although many different statistics have been em- 
ployed to isolate and constrain the thermal informa- 
tion contained in Lya forest spectra, the fiux prob- 
ability density function (PDF; 1-point function) and 
the flux power spectrum or auto-correlation function 
(2-point function), are among the most common(e.g. 



McDonald et al."2000l:' Zaldarriaga et al.ll2001l:lKim et ahl 
2007; Vicl ct al. 2009). But because the Lya transmis- 
sion F is significantly non-Gaussian, significant infor- 
mation is also contained in higher-order statistics. For 
example wavelet decompositions, which contains a hy- 
brid of real-space and Fourier-space information, have 
been a dvocated for measuring spatial temperature fluctu- 
ation s (|Lidz et al.ll2009t lZaldarriagall2002t iGarzilh et all 
|2012() . Several studies have focused on the on the b- 



mal parameters (iRicotti et al.|[2000t iSchave et al.|[2000l: 
[McDonald ct al.' '20011: iRudie et al.M2012D . and recentlv 
iBecker ct al. (2011) introduced a 'curvature' statistic as 
an alternative measure of spectral smoothness to the 
power spectrum. 

As gas pressure acts to smooth the baryon density field 
in 3D, it is natural explore power spectra as a means to 
constrain the Jeans filtering scale. A major motivation 
for working in Fourier space, as opposed to the real-space 
auto-correlation function, is that it is much easier to deal 
with limited spectral resolution in Fourier space. The 
vast majority of close quasar pairs are too faint to be 
observed at echelle resolution FWHM ~ 5kms^^ where 
the Lya forest is completely resolved. Instead, spectral 
resolution has to be explicitly taken into account. But to 
a very good approximation the smoothing caused by lim- 
ited spectral resolution simply low-pass filters the fiux, 
and thus the shape of the fiux power spectrum is un- 
changed for fc-modes less than the spectral resolution 
cutoff fcrcs- Thus by working in fc-space, one can sim- 
ply ignore modes k > fcres and thus obviate the need 
to precisely model the spectral resolution, which can be 
challenging for slit-spectra. Finally, another advantage 
to fc-space is that, because fiuctuations in the IGM are 
only mildly non-linear, some of the desirable features of 
Gaussian random fields, such as the statistical indepen- 
dence of Fourier modes, are approximately retained, sim- 
plifying error analysis. In what follows we consider the 
impact of Jeans smoothing on longitudinal power spec- 
trum, as well as the simplest 2-point function that can be 
computed from quasar pairs, the cross-power spectrum. 

3.1. The Longitudinal Power Spectrum 

It is well known that the shape of the longitudi- 
nal power spectrum, and the high-A; thermal cutoff 
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Fig. 2. — Left panel: The ID dimensionless power spectrum of the Lyo forest at 2 = 3. In our large grid of thermal models, we can identify 
two very different parameter combinations, represented by the solid (blue) and dashed (green) curves, which provide an equally good fit to 
the longitudinal power spectrum measurements from(McDonald et al. (2000) (red squares) and Croft ct al. (2002) (cyan circles), illustrating 
the strong degeneracies between these parameters (To,7,Aj). In light of those degeneracies, it is clear that it would bo extremely challenging 
to constrain these parameters with the longitudinal power alone. Right panel: The dimensionless cross power spectrum n(k;r±)k/TT (solid 
line) at A; Si 0.05 s/km from our simulated skewers, as a function of for the same two thermal models shown at left, with error bars 
estimated from a sample of 20 pairs. The degeneracy afflicting the ID power is broken using the new information provided by close quasar 
pairs, because the different Jeans scales result in differing amounts of transverse spectral coherence, providing much better prospects for 
measuring Aj. We also show the cross modulus {pi{k)p2{k))k/iT (dashed lines) for the same two models, which show flat variation with 
rx, and a very weak dependence on the Jeans scale. Most of the information about the 3D Jeans smoothing resides not in the moduli, but 
rather in the phase differences between homologous modes (see discussion in S I4.3| l. 



in partic ular, can be used constrain t he Tq and 7 
(|Zaldarri aga et alJ 120011 : iViel et all l2009f ). This cutoff 
arises because thermal broadening smooths r in redshift- 
space (e.g. eqn.[8]). In contrast to this ID smoothing, the 
Jeans filtering smooths the IGM in 3D, and it is exactly 
this confluence between ID and 3D smoo thing that we 
want to understand (see also lPeeples et al . 2G09a,b). We 
consider the quantity 5F{v) — {F — F) / F , where F is the 
mean transmitted flux, and compute the power spectrum 
according to 

{\SHk)\'). 



P{k) 



(9) 



where SF{k) denotes the Fourier transform of 6F for lon- 
gitudinal wavenumber fc, and angular brackets denote an 
suitable ensemble average (i.e. over our full sample of 
spectra). 

In Figure [5] we compare two thermal models in our 
thermal parameter grid to measurements of the longi- 
tudinal power spectrum of the Lyg fore st at z ~ 3 
(|McDonald et all [2000t ICroft et all [200I) . The blue 
(solid) curve has a large Jeans scale Aj = 214 kpc, a 
cooler IGM Tq — 13, 000 K, and a nearly isothermal 
temperature-density relation 7 = 0.9, which is mildly 
inverted such that voids are hotter than overdensities. 
Such isothermal or even inverted equations of state 
could arise at z ~ 3 from He II reionization heating 
(|McQuinn et al.ll2009HTittlev fc M ciksin 2007b), and re- 
cent analyses of the flux PDF (IBolton et al.ll20p8[) as well 
joint analysis of PDF and power-spectrum (|Viel et al.l 
l2009tlCalura et al.llMllGarzilh et al.ll2012f ) have argued 
for inverted or nearly isothermal values of 7. The green 
(dashed) curves have a smaller Jeans scale Aj — 100 kpc, 
a hotter IGM Tq = 18, 000 K, and a steep 7 = 1.6 
temperature-density relation consistent with the asymp- 



totic value if the IGM has not undergone significant re- 
cent heating events (Hui & Gnodin 1997; Hui & Haima^ 
|2003| ) . Thus with regards to the longitudinal power spec- 
trum, the Jeans scale is clearly degenerate with the am- 
plitude and slope (10,7) of the temperature-density re- 
lation. One would clearly come to erroneous conclusions 
about the equation of state parameters {Tq,"/) from lon- 
gitudinal power spectrum measurements, if the lack of 
knowled ge of the Jeans scale i s not marginalized out 
(see e.g. iZaldarriaga et al.l[200ll for an example of this 
marginalization) . 

This degeneracy in the longitudinal power arises be- 
cause the Jeans filtering smooths the power in 3D on a 
scale which project to a longitudinal velocity 



H{z = 3) , 
1 + 3 



26 



A, 



340 kpc 



km s 



(10) 



resulting in a cutoff of the power at kj « 0.04 s km"~ 
(for the typical values assumed in the introductior0) . 
The thermal Doppler broadening of Lya absorption lines 
smooths the power in ID, on a scale governed by the h- 
parameter 



l 2kBT 



15.7 



T 



1.5 X 104 K 



1/2 



km s 



(11) 



which results in an analogous cutoff at fcth = \/2/ 
b « 0.09 s km^^ for a temperature of 15000 K. Above 
fc_B is the Boltzmann constant, nip the proton mass, and 

^ We caution that this estimate assumes a thermal history where 
T DC 1-1-2, without considering the effect of Hell reionization. 
In that case the deduced value for the filtering scale Xj would 
probably be smaller. 
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~ 0.59 is the mean molecular weight for a primordial, 
fully ionized gas. The fact that the two cutoff scales are 
comparable results in a strong degeneracy which is very 
challenging to disentangle with longitudinal observations 
alone. Similar degeneracies between the Jeans scale and 
(To, 7) exist if one considers wavelets, the curvature, the 
6-parameter distribution, and the flux PDF, which we ex- 
plore in an upcoming study (Rorai et al. 2013, in prep). 
In the next section we show that this degeneracy between 
3D and ID smoothing can be broken by exploiting addi- 
tional information in the transverse dimension provided 
by close quasar pairs. 

3.2. Cross Power Spectrum 

The foregoing discussion illustrates that the 3D (Jeans) 
and ID (thermal broadening) smoothing are mixed in 
the longitudinal power spectrum, and ideally one would 
measure the full 3D power spectrum to break this de- 
generacy. For an isotropic random field the ID power 
spectrum P{k) and the 3D power Psoik) are related ac- 
cording to 



P^n - ^T- 



1 1 dP{k) 



2tt k dk 



(12) 



However, in the Lya forest redshift-space distortions and 
thermal broadening result in an anisotropics that render 
this expression invalid. 

With close quasar pairs, transverse correlations mea- 
sured across the beam contain information about the 3D 
power, and can thus thus disentangle the 3D and ID 
smoothing. Consider for example the cross-power spec- 
trum 7r(A:, rj_) oftwo spectra 6Fi{v) and SF2{v) separated 
by a transverse distance r± 



TT{k;r^)^n[SF*{k)6F2{k)]. 



(13) 



When r± then dF2 — > 6F1 and the cross-power 
tends to the longitudinal power P{k). The cross-power 
can be thought of as effectively a power spectrum in the 
longitudinal direction, and a c orrelation functi on in the 
transverse direction (see also iViel et all |2002|) . Alter- 
natively stated, the cross power provides a transverse 
distance dependent correction to the longitudinal power 
P{k), reducing it from its maximal value at 'zero lag' 
= 0. This further implies that measuring the cross 
power of closely separated and thus highly coherent spec- 
tra amounts to, at some level, a somewhat redundant 
measurement of the longitudinal power which could be 
simply deduced from isolated spectra. In the next sec- 
tion, we will explain how to isolate the genuine 3D infor- 
mation provided by close quasar pairs using a statistic 
that is more optimal than the cross-power. Nevertheless, 
Figure [5] shows the cross-power spectrum for the two de- 
generate models discussed in the previous section, clearly 
illustrating that even the sub-optimal cross-power spec- 
trum can break the strong degeneracies between thermal 
parameters that are present if one considers the longitu- 
dinal power alone. 

4. PHASE ANGLES AND THE JEANS SCALE 

Although the cross-power has the ability to break the 
degeneracy between 3D and ID smoothing present in the 
longitudinal power, we demonstrate here that the cross- 
power (or equivalently the cross-correlation function) is 



however not optimal, and indeed the genuine 3D infor- 
mation is encapsulated in the phase differences between 
homologous Fourier modes. 

4.1. Drawbacks of the Cross Power Spectrum 
Let us write the ID Fourier transform of the field 5F 



as 



5F{k) = p(fc)e'''('=) 



(14) 



where the complex Fourier coefficient is described by a 
modulus p and phase angle 0, both of which depend on 
k. Note that for any ensemble of spectra P(k) = (p^(fc)), 
hence the modulus p{k) is a random draw from a distri- 
bution whose variance is given by the power spectrum. 
From eqn. the cross-power of the two spectra 5Fi {v) 
and (5^2 (u) is then 



7ri2(fc) = pi(fc)p2(fc)cos(6'i2(fc)). 



(15) 



where 6i2{k) = 9i{k) — 6*2 (fc) is the phase difference be- 
tween the homologous fc— modes. The distribution of the 
moduli pi and p2 are also governed by P{k), but at small 
impact parameter they are not statistically independent 
because of spatial correlations. Nevertheless, the mod- 
uli contain primarily information already encapsulated 
in the longitudinal power, and are thus affected by the 
same thermal parameter degeneracies that we described 
in the previous section. For the purpose of constrain- 
ing the Jeans scale, we thus opt to ignore the moduli 
pi and p2 altogether, in an attempt to isolate the gen- 
uine 3D information, increasing sensitivity to the Jeans 
scale, while minimizing the impact of thermal broaden- 
ing, removing degeneracies with the temperature-density 
relation parameters (To,7). 

The foregoing points are clearly illustrated by the 
dashed curves in the right panel of Figure [21 which com- 
pares the quantity {pi{k)p2{k)) as a function of impact 
parameter r±^ for the same pair of thermal models dis- 
cussed in § 13.11 which are degenerate with respect to the 
longitudinal power. The similarity of these two curves re- 
flects the degeneracy of the longitudinal power for these 
two models, and one observes a flat trend with and 
a very weak dependence on the Jeans scale Aj, substan- 
tiating our argument that the moduli contain primarily 
ID information. 

As the moduli contain minimal information about the 
3D power, we are thus motivated to explore how the 
phase difference 9i2{k) can constrain the Jeans scale. In 
terms of Fourier coefficients, 9i2{k) can be written 



9i2{k) — arccos 



^[5Fl{k)5F2{k)] 
'\5F,{k)\^\5F2{k)\^ 



(16) 



Note that because the phase difference is given by a ratio 
of Fourier modes, it is completely insensitive to the nor- 
malization of 5F ^ and hence to quasar continuum fitting 
errors, provided that these errors do not add power on 
scales comparable to k. In the remainder of this section, 
we provide a statistical description of the distribution of 
phase differences and we explore the properties and de- 
pendencies of this distribution. To simplify notation we 
will omit the subscript and henceforth denote the phase 
difference as simply 9{k,r±) = 9i{k) —92(k), where r± is 
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Fig. 3. — Schematic representation of the heuristic argument 
used to determine the phase difference distribution; phase are 
determined by density filaments crossing the lines of sight of 
two quasars. If the orientation of the filaments i/j is isotropi- 
cally distributed then 9' , dependent on the longitudinal distance 
L = rx tan ip, follows a Cauchy distribution. 

the transverse distance between the two spectra 5Fi{v) 
and 5F2{v). 

4.2. An Analytical Form for the PDF of Phase 
Differences 

The phase difference between homologous /c-modes is a 
random variable in the domain [— vr, tt], which for a given 
thermal model, depends on two quantities: the longi- 
tudinal mode in question k and the transverse separa- 
tion rx- One might advocate computing the quantity 
(cos0(fc, rj,)) analogous to the cross-power (see eqn. [T3|) . 
or the mean phase difference {9{k,r±)), to quantify the 
coherence of quasar pair spectra. However, as we will 
see, the distribution of phase differences is not Gaussian, 
and hence is not fully described by its mean and vari- 
ance. This approach would thus fail to exploit all the 
information encoded in its shape. Our goal is then to de- 
termine the functional form of the distribution of phase 
differences at any (fc, rx), and relate this to the ther- 
mal parameters governing the IGM. This is a potentially 
daunting task, since it requires deriving a unique func- 
tion in the 2-dimensional space r±) for any location 
in our 3-dimensional thermal parameter grid (ro,7, Aj). 
Fortunately, we are able to reduce the complexity consid- 
erably by deriving a simple analytical form for the phase 
angle distribution. 

We arrive at a this analytical form via a simple heuris- 
tic argument, whose logic is more intuitive in real space. 
Along the same lines, we focus initially on the IGM den- 
sity distribution along ID skewers, and then later demon- 
strate that the same form also applies to the Lya flux 
transmission. Consider a filament of the cosmic web 
pierced by two quasar sightlines separated by r±, and 



oriented at an angle ip relative to the transverse direc- 
tion. A schematic representation is shown in Figure |3l 
This structure will result in two peaks in the density field 
along the two sightlines, separated by a longitudinal dis- 
tance of L = r± taniy9. If we assume that the positions of 
these density maxima dictate the position of wave crests 
in Fourier space, the phase difference for a mode with 
wave number k can be written as 9' — kL — kr± tamp. 
We can derive the probability distribution of the phase 
difference by requiring that p{9')d6' = p{ip)dip, and as- 
suming that, by symmetry, ip is uniformly distributed. 
This implies that 9' follows the Cauchy-distribution 



1 



1 



eT:l + {9'/e)^ 



(17) 



where e parametrizes the distribution's concentration. 
As a final step, we need to redefine the angles such that 
they reside in the proper domain. Because tamp spans 
the entire real line, so will 9'; however, for any integer n, 
all phases 6' + 27rn corresponding to distances L + 2TTn/k 
will map to identical values of 9, defined to be the phase 
difference in the domain [— tt, tt]. Redefining the domain, 
requires that we re-map our probabilities according to 



(18) 



a procedure known as 'wrapping' a distribution. Fortu- 
nately, the exact form of the wrapped-Cauchy distribu- 
tion is known: 



wc 



1 



27r 1 + - 2C cos(6i - ^) ^ 



(19) 



where /i = (9) is the mean value (in our case /i = by 
symmetry) , and ^ is a concentration parameter between 
and 1, which is the wrapped analog of e above. In the 
limit where C ~^ 1 the distribution tends to a Dirac delta 
function Sd{x), which is the behavior expected for identi- 
cal spectra. Conversely, C = results in a uniform distri- 
bution, the behavior expected for uncorrelated spectra. 
A negative C gives distributions peaked at = tt and is 
unphysical in this context. 

4.3. The Probability Distribution of Phase Differences 
of the IGM Density 

We now show that this wrapped-Cauchy form does a 
good job of describing the real distribution of phase dif- 
ferences for our simulated IGM density skewers. Note 
that for our simple heuristic example of randomly ori- 
ented filaments, the concentration parameter ^ only de- 
pends on the product of kr±; whereas, in the real IGM, 
one expects the spectral coherence quantified by C to de- 
pend on the Jeans scale Aj. Because we do not know 
how to directly compute the concentration parameter in 
terms of the Jeans scale from first principles, we opt to 
calculate ^ from our simulations. At any longitudinal 
wavenumber fc, pair separation r±, and Jeans scale Aj, 
our density skewers provide a discrete sampling of the 
9 distribu tion. We use the maximum likelihood proce- 
dure from lJammalamadaka &: Sengup ta (2001) to calcu- 
late the best-fit value of C from an ensemble of 9 values, 
as described further in Appendix [BJ Figure U shows the 
distribution of phases determined from our IGM den- 
sity skewers (symbols with error bars) compared to the 
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Fig. 4. — Phase difference probability functions of the density fields at different separations r±, wavenumbers k and Jeans scale Aj. 
Points with errorbars represent the binned phase distribution of the density field as obtained from the simulation, while the solid lines 
are the best-likelihood fit using a wrapped-Cauchy distribution. When the spectra are highly correlated the phases are small and the 
distribution is peaked around zero, whereas independent skewers result in fiat probability functions. The error are estimated from the 
number of modes available in the simulation, assuming a Poisson distribution. By symmetry p{0) must be even in 9, hence it is convenient 
to plot only the range [0,7r], summing positive and negative probabilities (clearly obtaining ) to increase the sampling in each bin. 

We express the scale of each mode both giving the wavelength A in Mpc and the wave number k (in s km~^) in the transformed velocity 
space. The wrapped-Cauchy function traces with good approximation the phase distribution obtained from the simulation, showing less 
accuracy in the cases of strongly concentrated peaks, where low- probability bins are noisy. Each color is a different smoothing length: 
Aj = 50, 100 and 200 kpc (respectively black, red and blue). It is important to notice that the relative distributions arc different not only 
at scales comparable to Aj, but also for larger modes, because the 3D power of high-fc modes when projected on a ID line contributes to all 
the low-fc components (see the text for a detailed discussion). Secondly, it is clear that the most relevant pairs are the closest {rj_ < Aj), 
because for wide separations the coherence is too low to get useful information. These two consideration together explain why close quasar 
pairs are the most effective objects to measure the Jeans scale, even if they cannot be observed at high resolution. 



best-fit wrapped-Cauchy distributions (curves) for differ- 
ent longitudinai modes k, transverse separations r±, and 
values of the Jeans scale Aj. We see that the wrapped- 
Cauchy distribution typically provides a good fit to the 
simulation data points to within the precision indicated 
by the error bars. For very peaked distributions which 
correspond to more spectral coherence (i.e. low-k or large 
Aj), there is a tendency for our wrapped-Cauchy fits to 
overestimate the probability of large phase differences 
relative to the simulated data, although our measure- 
ments of the probability are very noisy in this regime. 
We have visually inspected similar curves for the entire 
dynamic range of the relevant fc, rj_ and Aj, for which the 



shape of the wrapped-Cauchy distribution varies from 
nearly uniform ~ 0) to a very high degree of coher- 
ence (C — 1), and find similarly good agreement. 

It is instructive to discuss the primary dependencies of 
the phase difference distribution on wavenumber k, sep- 
aration rj_, and the Jeans scale Aj illustrated in Figure 
[3) At a fixed wavenumber fc, a large separation relative 
to the Jeans scale results in a flatter distribution of 6, 
which approaches uniformity for r_L 3> A j . The distribu- 
tion approaches the fully coherent limit of a Dirac delta 
function for r± <^ Xj, and the transitions from a strongly 
peaked distribution to a uniform one occurs when r± is 
comparable to the Jeans scale Aj. We see that quasar 
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pairs with transverse separations r± < 3Aj, contain in- 
formation about the Jeans scale, whereas this sensitivity 
vanishes for larger impact parameters. At fixed rj_, lower 
fc-modes (i.e. larger scales) are more highly correlated 
(smaller 9 values) as expected, because sightlines spaced 
closely relative to the wavelength of the mode kr± ^ 1, 
probe essentially the same large scale density fluctuation. 
Overall, the dependencies in Figure|4]illustrate that there 
is information about the Jeans smoothing spread out over 
a large range of longitudinal fc-modes. Somewhat surpris- 
ingly, even modes corresponding to wavelengths > 100 
times larger than Aj can potentially constrain the Jean 
smoothing. 

This sensitivity of very large-scale longitudinal fc- 
modes to a much smaller scale cutoff Aj in the 3D power 
merits further discussion. First, note that the range 
of wavenumbers typically probed by longitudinal power 
spectra of the Lya forest lie in the range 0.005 s km~^ < 
fc < G.ls km^^ (see Figured]), corresponding to modes 
with wavelengths 60 km s~^ < v < 1250 km s~^ or 
830 kpc < A < 17Mpc. Here the low-fc cutoff is set by 
systema tics relate d to determining the quasar continuum 
(see e.g. lLeeir2012[ ). whereas the high-fc cutoff is adopted 
to mitigate contaminati on of the small-scale p ower from 
metal absorption lines ([McDonald et al.l 120001 ). In prin- 
ciple high-resolution (echelle) spectra FWHM= 5 km s~^ 
probe even higher wavenumbers as large as fc ~ 3, how- 
ever standard practice is to only consider fc < 0.1 in 
model- fitting (see e.g. iZaldarriaga et al.l |2001[ ). Thus 
even the highest fc-modes at our disposable fc ~ 0.1 corre- 
spond to wavelengths ~ 830 kpc significantly larger than 
our expectation for the Jeans scale ~ 100 kpc. Further- 
more, we saw in § 13.11 that degenerate combinations of 
the Jeans smoothing and the IGM temperature-density 
relation can produce the same small-scale cutoff in the 
longitudinal power. Thus both metal-line contamination 
and degeneracies with thermal broadening imply that 
while it is extremely challenging to resolve the Jeans scale 
spectrally, the great advantage of close quasar pairs is 
that they resolve the Jeans scale spatially, provided they 
have transverse separations r± comparable to Aj. We 
will thus typically be working in the regime where fc/ 
fc_L 1, where we define k± = xo/aHr^, where aHr± 
is the transverse separation converted to a velocity and 
xq = 2.4048 is a constant the choice of which will become 
clear below. 

In this regime, it is straightforward to understand why 
the phase differences between large-scale modes are nev- 
ertheless sensitive to the Jeans scale. Consider the quan- 
tity {cosd{k,r±)) , which is related to the cross-power 
discussed in § 14 II This 'moment' of the phase angle 
PDF can be written 



(cos 9{k, r±)) 



P{e{k,r^))cos0{k,r^)de, (20) 



which tends toward zero for totally uncorrelated spectra 
{P{d) = l/27r) and towards unity for perfectly correlated, 
i.e. identical spectra {P{d) — ^d{S)) spectra. Following 
the discussion in § 14.11 we can write 

7r(fc,rx) = (pi(fc)p2(fc)cos0(fc,rj.))« (21) 
(pi(fc)p2(/e))(cos0(fc, r^)) « P(fc)(cos^(fc, r±)), 



where the first approximation is a consequence of the 
approximate Gaussianity of the density fiuctuations, and 
the second from the fact that {P1P2) ~ P(k) for fc/fcj_ ^ 
1, as demonstrated by the dashed curves in the right 
panel of Fig [2l Thus we arrive at 



(cos6'(fc, rx)) 



7r(fc,ri) Jl^ dqqJo{r±y/(f^^)P3B{q) 



(22) 

where Jq is the cylindrical Bessel function of order zero. 
The numerator and denominator of the last equality in 
eqn. ([22l) follow from the definitions of the longitudinal 
and cross power for an isotropic 3D powe r spectrum (see 
e,£jLumsden qtalJil989; Peacock 1999- Hui et al.lll99l 
iViel et al. 2002). The denominator is the familiar expres- 
sion for the ID power expressed as a projection of the 3D 
power. Note that ID modes with wavenumber fc receive 
contributions from all 3D modes with wavevectors > fc, 
which results simply from the geometry of observing a 
3D field along a ID skewer. A long-wavelength (low- 
fc) ID longitudinal mode can be produced by a short- 
wavelength (high-fc) 3D mode dire cted nearly perp endic- 
ular to the line of sight (see e.g. iPeacoc g |l99l . The 
numerator of eqn. p2[) is similarly a projection over all 
high-fc 3D modes, but because of the non-zero separation 
of the skewers the 3D power spectrum is now modulated 
by the cylindrical Bessel function Jo{x). Because Jo{x) is 
highly oscillatory, the primary contribution to this pro- 
jection integral will come from arguments in the range 
< X < Xq. Here xq = 2.4048 is the first zero of Jo{x), 
which motivates our earlier definition of fcj_ = xo/aHr±. 
For larger arguments x, the decay of Jo{x) and its rapid 
oscillations will result in cancellation and negligible con- 
tributions. Thus for k/kj_ <C 1, we can finally write 



(cos 9{k, r±)) 



Ik dqqMr±VQ^^)PMq) 



JT dqqP3B{q) 



(23) 



This equation states that the average value of the phase 
difference between homologous fc modes is determined by 
the ratio of the 3D power integrated against a 'notch fil- 
ter' which transmits the range [fc, fcj,], relative to the to- 
tal integrated 3D power over the full range [fc, 00]. Hence 
phase angles between modes with wavelengths > 100 
times larger than Aj, are nevertheless sensitive to the 
amount of 3D power down to scales as small as the trans- 
verse separation r± . This results simply from the geom- 
etry of observing a 3D field along ID skewers, because 
the power in longitudinal mode fc is actually dominated 
by the superposition of 3D power from much smaller 
scales ^ fc. Provided that quasar pair separations re- 
solve the Jeans scale r± ~ Aj, even large scale modes 
with fc ^ fc_L ~ 1/Aj are sensitive to the shape of the 
3D power on small-scales, which explains the sensitivity 
of low-fc modes to the Jeans scale in Figure 01 

Finally, the form of eqn. p3)) combined with eqn. (|20p 
explains the basic qualitative trends in Figure |4l For 
large r± (small fc^) the projection integral in the numer- 
ator decreases, {cos9{k,r±)) approaches zero, indicat- 
ing that P(9{k,r±)) approaches uniformity. Similarly, as 
r± Aj, {cos6{k,r±)) grows indicating that P(6'(fc, rj^)) 
is peaked toward small phase angles, and in the limit 
r± ^ Aj {cos6{k,r±)) — >■ 1 and P{6{k,r±)) approaches 
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Fig. 5. — Same plot of figure [4] but for the Lya transmitted fiux field instead of density. We vary the Jeans scale Aj, keeping fixed 
the equation-of-state parameters, Tq = 10000 K and 7 = 1.6. The properties of the distributions are analogous to the previous plot, they 
follow with good approximation a wrapped-Cauchy profile and they exhibit the same trends with rx, k and Aj. Overall, the flux shows an 
higher degree of coherence and a slightly smaller sensitivity to Aj. 



a Dirac delta function. At fixed r^, lower k modes will 
result in more common pathlcngth in the projection in- 
tegrals in the numerator and denominator of eqn. ()23|) . 
thus {cos9{k,r±)) is larger, P{9{k,r±)) is more peaked, 
and the phase angles are more highly correlated. 

To summarize, following a simple heuristic argument, 
we derived a analytical form for the phase angle distri- 
bution in § 14.21 which is parametrized by a single num- 
ber, the concentration C- We verified that this simple 
parametrization provides a good fit to the distribution 
of phase differences in our simulated skewers, and ex- 
plored the dependence of this distribution on transverse 
separation rj_, wavenumber k, and the Jeans scale Aj. 
Phase differences between large-scale modes with small 
wavenumbers k <C 1/Aj, are sensitive to the Jeans scale, 
because geometry dictates that \ow-k cross-power across 
correlated ID skewers is actually dominated by high-/c 3D 
modes up to a scale set by the pair separation kj_ ^ 1/ 



4.4. The Probability Distribution of Phase Differences 
of the Flux 

Having established that the wrapped-Cauchy distribu- 
tion provides a good description of the phase difference 
of IGM density skewers, we now apply it to the Lya for- 
est flux. Figure [5] shows the PDF of phase differences 
for the exact same transverse separations r±, wavenum- 
bers k, and Jeans smoothings Aj that were shown in 
Figure m The other thermal parameters Tq and 7 have 
been set to (To,7) = (10, 000 K, 1.6). Overall, the be- 
havior of the phase angle PDF for the flux is extremely 
similar to that of the density, exhibiting the same ba- 
sic trends. Namely, the flux PDF also transitions from 
a strongly peaked distribution (r± < Aj) to a flat one 
{r± ^ Aj) at around r± ~ Aj. Lower fc-modes tend to 
be more highly correlated, and \ow-k modes correspond- 
ing to wavelengths > lOOAj are nevertheless very sensi- 
tive to the Jeans scale, in exact analogy with the density 
field. Note that because the 3D power spectrum of the 
flux field is now anisotropic, the assumptions leading to 
the derivation of eqn. [23] in the previous section breaks 
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Fig. 6. — Phase difference probability density functions for different separations r and wavenumbers k. All models have the same Jeans 
scale J = 140 kpc. For clarity we plot only the best-fit wrapped-Cauchy function without simulated points with errorbars. The black 
and the red lines are the phase angle PDFs for the transmitted flux of the Ly forest and the IGM density field, respectively. The green 
line represents the case of the Ly forest flux where peculiar velocities are set to zero. By comparing the green and the black lines we see 
that in peculiar motions always increase the coherence between the two sightlines, which partly explains the differences between the flux 
and density distributions, since the latter is calculated in real space. The flux and density further differ because of the non-linear FGPA 
transformation, which has a stronger effect on smaller scale modes. 

down for the flux. Nevertheless, the explanation for the 
sensitivity of low-fc modes to the Jeans scale is likely the 
same, namely the \ow-k power across correlated skewers 
is actually dominated by projected high-A: 3D power up 
to a scale k± ^ I /r± , which is set by the pair separation. 

The primary difference between the phase angle PDF 
of flux versus the density appears to be that the flux 
PDF is overall slightly less sensitive to the Jeans scale. 
In general, we do not expect the two distributions to be 
exactly the same for several reasons. First, the flux rep- 
resents a highly nonlinear transformation of the density: 
according to the FGPA formula 6F ^ exp [— (1 -I- (S)^] 
where /3 = 2 — 0.7(7—1). Second, the flux is observed 
in redshift space, and the peculiar velocities which deter- 
mine the mapping from real to redshift space, can further 
alter the flux relative to the density. Finally, the flux 
fleld is sensitive to other thermal parameters Tq and 7, 
both through the nonlinear FGPA transformation, and 
because of thermal broadening. In what follows, we in- 



vestigate each of these effects in turn, and discuss how 
each alters the phase angle PDF and its sensitivity to the 
Jeans scale. 

In Figure [6] we show the flux PDF (black) alongside 
the density PDF (red) for various modes and separa- 
tions, again with the thermal model fixed to (Tq, 7, Aj) — 
(20, 000 K, 1.0, 140) kpc. To isolate the impact of pecu- 
liar velocities, we also compute the phase angle PDF 
of the real-space flux, i.e. without peculiar velocities 
(green). Speciflcally, we disable peculiar velocities by 
computing the flux from eqn. ^ with Wp || set to zero. 
Overall, the PDFs of the real-space flux and density (also 
real-space) are quite similar. For low wavenumbers, the 
real-space flux skewers are always slightly more coher- 
ent than the density {P{0) more peaked) for all separa- 
tions. However, at the highest fc, the situation is reversed 
with the density being more coherent than the real-space 
flux. A detailed explanation of the relationship between 
the phase angle PDF of the real-space flux and the den- 
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Fig. 7. — Phase difference probability density functions for different separations r_i_, wavenumbers k and equation-of-state parameters 
To — 7. Points with errorbars (estimated Poisson error) are the results of our simulations, while the coloured lines are the best-likelihood fit 
using a wrapped-Cauchy distribution. All models have the same Jeans scale Aj = f 40 kpc. This plot shows the most remarkable property 
of phases: they do not exhibit any relevant sensitivity to the equation of state, so they robustly constrain the spatial coherence given by 
pressure support. 



sity fields requires a better understanding of the effect 
of tfie non-linear FGPA transformation on the 2-point 
function of the flux, which is beyond the scope of the 
present work. Here we only argue that the 3D power 
spectrum of the real-space flux has in general a different 
shape than that of the density, and using our intuition 
from eqn. , this will result in a different shape for the 
distribution of phase angles. The net effect of peculiar ve- 
locities on the redshift-space flux PDF is to increase the 
amount of coherence between the two sightlines {P{0) 
more peaked) relative to the real-space flux. This likely 
arises because the peculiar velocity field is dominated by 
large-scale power, which makes the 3D power of the flux 
steeper as a function of k. Again based on our intuition 
from eqn. (|22|) . a steeper power spectrum will tend to 
increase the coherence ((cos(6'(A:, rj^))) closer to unity), 
because the projection integrals in the numerator and 
denominator of eqn. (p2)) will both have larger relative 
contributions from the interval [fc, k±]. Note that the rel- 
ative change in the flux PDF due to peculiar velocities 
is comparable to the differences between the real-space 



flux and the density. At the highest fc-values where the 
real-space flux is less coherent than the density (lowest 
panel of Figure [6]) , peculiar velocities conspire to make 
the redshift-space flux PDF very close to the density 
PDF. 

Finally, we consider the impact of the other thermal 
parameters Tq and 7 on the distribution of phases in 
Figure [71 There we show the PDF of the phase an- 
gles for the flux for a fixed Jeans scale Aj — 140 kpc, 
and three different thermal models. Varying Tq and 
7 over the full expected range of these parameters has 
very little impact on the shape of the phase angle PDF, 
whereas we see in Figure [5] that varying the Jeans scale 
has a much more dramatic effect. The physical expla- 
nations for the insensitivity to Tq and 7 are straightfor- 
ward. The thermal parameters Tq and 7 can influence 
the phase angle PDF in two ways. First, the FGPA de- 
pends weakly on temperature T~^ '^ through the recom- 
bination coefficient. As a result the non-linear transfor- 
mation between density and flux depends weakly on 7 
SF ~ exp[-{l + Sf] where ^ = 2 - 0.7(7 - !)• We 



15 



speculate that the tiny differences between the thermal 
models in Figure [7] are primarily driven by this effect, 
because we saw already in Figure [7] that the non-linear 
transformation can give rise to large differences between 
the density and flux PDFs. This small variation of the 
PDF with 7 then suggests that it is actually the expo- 
nentiation which dominates the differences between the 
flux and density PDFs in Figure [3 with the weaker 7 
dependent transformation (1 + (5)2-0-7(7-1) playing only 
a minor role, which is perhaps not surprising. Note that 
there is also a Tq""'^ dependence in the coefficient of the 
FGPA optical depth, but as we require all models to have 
the same mean flux (exp(— t)), this dependence is com- 
pensated by the freedom to vary the metagalactic pho- 
toionization rate F. Second, both Tq and 7 determine the 
temperature of gas at densities probed by the Lya forest, 
which changes the amount of thermal broadening. The 
insensitivity to thermal broadening is also rather easy to 
understand. Thermal broadening is effectively a convo- 
lution of the flux field with a Gaussian smoothing kernel. 
In fc-space this is simply a multiplication of the Fourier 
transform of the flux SF(k) with the Fourier transform 
of the kernel. Because all symmetric kernels will have 
a vanishing imaginary parfl the convolution can only 
modify the moduli of the flux but the phases are invari- 
ant. Thus the phase differences between neighboring flux 
skewers are also invariant to smoothing, which explains 
the insensitivity of the flux phase angle PDF to thermal 
broadening, and hence the parameters Tq and 7. 

The results of this section constitute the cornerstones 
of our method for measuring the Jeans scale. We found 
that the phase angle PDF of the flux has a shape very 
similar to that of the density, and that both are well 
described by the single parameter wrapped- Cauchy dis- 
tribution. Information about the 3D smoothing of the 
density field Aj, is encoded in the phase angle PDF of the 
flux, but it is essentially independent of the other ther- 
mal parameters governing the IGM. This results because 
1) the non-linear FGPA transformation is only weakly 
dependent on temperature 2) phase angles are invariant 
under symmetric convolutions. The implication is that 
close quasar pair spectra can be used to pinpoint the 
Jeans scale without suffering from any significant degen- 
eracies with To and 7. Indeed, in the next section we 
introduce a Bayesian formalism for estimating the Jeans 
scale, and our MCMC analysis in § |6] will assess the ac- 
curacy with which the thermal parameters can be mea- 
sured, and explicitly demonstrate the near independence 
of constraints on Aj from Tq and 7. 

5. ESTIMATING THE JEANS SCALE 

5.1. The Covariance of the Phase Differences 

In the previous section, we showed that the PDF 
of phase differences between homologous longitudinal 
modes of the flux fleld are well described by the wrapped- 
Cauchy distribution (see eqn. 1191) . However, the one- 
point function alone is insufficient for characterizing the 
statistical properties of the stochastic field d{k,r±), be- 
cause in principle values of 9 closely separate in either 

® The imaginary part of the Fourier transform of the symmetric 
function is Q[VF{A;)] = f W {\x\) sm(kx)dx which is always 

odd and will integrate to zero. 



wavenumber k or real-space could be correlated. Un- 
derstanding the size of these two-point correlations is of 
utmost importance. Any given quasar pair spectrum pro- 
vides us with a realization of 9{k,r±), and we have seen 
that the distribution of these values depends sensitively 
on the Jeans scale Aj. In order to devise an estimator for 
the thermal parameters in terms of the phase differences, 
we have to understand the degree to which the 9{k,r±) 
are independent. 

It is easy to rule out the possibility of spatial correla- 
tions among the 9 values deduced from distinct quasar 
pairs. Because quasar pairs are extremely rare on the sky, 
the individual quasar pairs in any observed sample will 
typically be ~ Gpc away from each other, and hence dif- 
ferent pairs will never probe correlated small-scale den- 
sity fiuctuations. However, the situation is much less 
obvious when it comes to correlations between 9 values 
for different fc-modes of the same quasar pair. In partic- 
ular, nonlinear structure formation evolution will result 
in mode-mode coupling, which can induc e correlations 
between mode am plitud es and phas es (e.g. iChiang et al.l 
[200I iWatts et al.ll200l [Colesll200i ). We are thus moti- 
vated to use our simulated skewers to directly quantity 
the size of the correlations between phase differences of 
distinct longitudinal fc-modes. 

We calculate the correlation coefficient matrix of 9 be- 
tween modes k and k' defined as 



Ce{k,k';r^) = 



{9{k,r^)9{k',r^)) 
^{9^k,r^)){9^k',r^))' 



(24) 



Our standard setup of 330 pairs at each discrete separa- 
tion r± results in a very noisy estimate of Ce{k, k'; r±), 
so we proceed by defining a new set of 80,000 skewers at 
two distinct discrete transverse separations of rj_ = 70 
kpc and rj_ = 430 kpc for a single thermal model with 
(To,7,Aj) = (20, 000 K, 1,143 kpc). 

Figure [5] displays the correlation coefficient matrix for 
the two separations r± that we simulated. We find that 
the off-diagonal correlations between fc-modes are high- 
est at high k values and for smaller impact parameters. 
This is the expected behavior, since higher longitudinal 
fc-modes will have a larger relative contributions from 
higher-fc 3D modes, which will be more non-linear and 
have larger mode-mode correlations. Likewise, as per the 
discussion in § 14.31 phase differences at smaller pair sep- 
arations rj_ are sensitive to higher k 3D power ~ fc^, 
and should similarly exhibit larger correlations between 
modes. Note however that over the range of longitudinal 
fc values which we will use to constrain the Jeans scale 
0.005 < k < 0.1, the size of the off-diagonal elements are 
always very small, of the order of ~ 1 — 3%. 

The small values of the off-diagonal elements indicates 
that the mode-mode coupling resulting from non-linear 
evolution does not result in significant correlations be- 
tween the phase angles of longitudinal modes. This could 
result from the fact that the intrinsic phase correlations 
of the 3D modes is small, and it is also possible that 
the projection of power inherent to observing along ID 
skewers (see § 14. 3p dilutes these intrinsic phase correla- 
tions, because a given longitudinal mode is actually the 
average over a large range of 3D modes. From a prac- 
tical perspective, the negligible off-diagonal elements in 
Figure [8] are key, because they allow us to consider each 
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Fig. 8. — Logarithm of the phase k — k correlation for separations rj_ = 70 kpc (loft) and rj_ = 430 kpc (right). This matrices are 
calculated for a model with Aj = 143 kpc, Tq = 20000 K and 7 = 1. Phases are more correlated when the impact parameter is smaller than 
the jeans scale and at high k where nonlinear growth of perturbations couples different modes. Even in this cases we rarely find correlations 
higher than 3%, for which reason we will work in the diagonal approximation. This approximation may break out if the measured Jeans 
scale will be significantly larger than expected. 

phase difference 9{k,r±) as an independent random draw 
from the probabihty distributions we explored in § 14.41 
which as we show in the next section, dramaticaUy sim- 
phfies the estimator that we will use to determine the 
Jeans scale. 



about any thermal parameter, for example for Aj 

J^{{0}\Xj)p{\j) 



(26) 



5.2. A Likelihood Estimator for the Jeans Scale 

The results from the previous sections suggest a simple 
method for determining Jeans scale. Namely, given any 
quasar pair, the phase angle difference for a given k-mode 
represents a draw from the underlying phase angle PDF 
determined by the thermal properties of the IGM (as 
well as other parameters governing e.g. cosmology and 
the dark matter which we assume to be fixed). In ii l4.4l we 
showed that the phase angle PDF is well described by the 
wrapped-Cauchy distribution and in S IS.ll we argued that 
correlations between phase angle differences 9{k,r±), in 
both fc-space and real-space can be neglected. Thus for 
a hypothetical dataset 0{k,r±) measured from a sample 
of quasar pairs, we can write that the likelihood of the 
thermal model M = {Tq, 7, Aj} given the data is 

^m\M) =l[Pwc{Oih,r,)\ak,r^\M)). (25) 



This states that the likelihood of the data is the product 
of the phase angle PDF evaluated at the measured phase 
differences for all fc-modes and over all quasar pair sep- 
arations r±. Note that the simplicity of this estimator 
is a direct consequence of the fact that there are negli- 
gible 9 correlations between different /c-modes and pair 
separations. All dependence on (ro,7, Aj) is encoded in 
the single parameter C, which is the concentration of the 
wrapped-Cauchy distribution fegn. [T9t . 
We can then apply Bayes' theorem to make inferences 



where p{Xj) is our prior on the Jeans scale and the de- 
nominator acts as a renormalization factor which is im- 
plicitly calculated by a Monte Carlo simulation over the 
parameter space. The same procedure can be used to 
evaluate the probability distribution of the other param- 
eters. Throughout this paper, we assume flat priors on 
all thermal parameters, over the full domain of physically 
plausible parameter values. 

In § [S] we will use MCMC techniques to numerically 
explore the likelihood in eqn. ([26l) and deduce the pos- 
terior distributions of the thermal parameters. In order 
to do this, we need to be able to evaluate the function 
({k,r±\To,j, Xj) at any location in thermal parameter 
space. This is a non-trivial computational issue, because 
we do not have a closed form analytical expression for C. 
which can be evaluated quickly, and thus have to resort to 
our cosmological simulations of the IGM to numerically 
determine it for each model, as described in Appendix [Bl 
In practice, computational constraints limit the size of 
our thermal parameter grid to only 500 thermal models, 
and we thus evaluate C at only these 500 fixed locations. 
In the next section, we describe a fast procedure referred 
to as an emulator, which allows us to interpolate C from 
these 500 locations in our finite thermal parameter grid, 
onto any value in thermal parameter space (Tq, 7, Aj). 

5.3. Emulating the IGM 

Our goal is to define an algorithm to calculate 
C(fc, fx|To, 7, A,/) as a function of the thermal param- 
eters, interpolating from the values determined on a 
fixed grid. As we will also compare Jeans scale con- 
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straints from the phase angle PDF (eqn. 1^51) . to those 
obtained from other statistics, such as the longitudinal 
power P{k) and cross-power 7r(fc,rj_) (see §|6]), we also 
need to be able to smoothly interpolate these functions 
as well. To achieve this, we follow the approach of the 
'Cosmic Calibration Framework' (CCF) to provide an 
accurate prediction scheme for cosmologic al observables 
(jHeitmann et alj 120061: iHabib et al.]|2007[ ). The aim of 
the CCF is to build emulators which act as very fast 
- essentially instantaneous - prediction tools for large 
scale struc ture observables such a s the nonlinear power 
spect rum (jHeitmann et al.l I2009L 120101 : iLawrence et al 



201C )■ or the concentration-mass relation ()Kwan et al 



20121) . Three essential steps form the basis of emulation. 



First, one devises a sophisticated space-filling sampling 
scheme that provides an optimal sampling strategy for 
the cosmological parameter space being studied. Sec- 
ond, a principle component analysis (PCA) is conducted 
on the measurements from the simulations to compress 
the data onto a minimal set of basis functions that can 
be easily interpolated. Finally, Gaussian process mod- 
eling is used to interpolate these basis functions from 
the locations of the space filling grid onto any value in 
parameter space. A detailed description of our IGM em- 
ulator will be described in a companion paper (A.Rorai 
et al. 2013, in preparation). Below we briefly summarize 
the key aspects. 

Whereas CCF uses more sophistica ted space filling 
Latin Hypercube sampling schemes (e.g. Hcitman n et al.l 
l2009f ). we adopt a simpler approach motivated by the 
shape of the IGM statistics we are trying to emulate, 
which change rapidly at scales comparable to either the 
Jeans or thermal smoothing scale. We opt for an ir- 
regular scattered grid which fills subspaces more effec- 
tively than a cubic lattice. We consider parameter values 
over the domain {(To,7,Aj) : Tq € [5000, 40000] K; 7 e 
[0.5,2]; Xj G [43,572]kpc}. The lower limit of 43 kpc 
for the Jeans scale is chosen because this is about the 
smallest value we can resolve with our simulation (see 
Appendix E]) . while the upper limit of 572 kpc is a con- 
servative constraint deduced from the longitudinal power 
spectrum: a filtering scale greater than this value would 
be inconsistent with the high— fc cutoff, regardless of the 
value of the temperature. The ranges considered for Tq 
and 7 are consistent with those typically considered in 
the literature and our expectations based on the physics 
governing the IGM. We sample the 3D thermal parame- 
ter space at 500 locations, where we consider a discrete 
set of 50 points in each dimension. A linear spacing of 
these points is adopted for 7, whereas we find it more ap- 
propriate to distribute Tq and Aj such that the scale of 
the cutoff of the power spectrum fe/ is regularly spaced. 



Since k 



f 



for Jeans smoothing and k 



-1-1/2 



for thermal broadening, we choose regular intervals of 
these parameters after transforming Aj — > 1/Aj and 
To I/a/Tq. Each of the 50 values of the parameters is 
then repeated exactly 10 times in the 500-point grid, and 
we use 10 different random permutations of their indices 
to fill the space and to avoid repetition. For each thermal 
model in this grid, we generate 10,000 pairs of skewers 
at 30 linearly spaced discrete pair separations between 
and 714 kpc. 

We then use these skewers to compute the IGM statis- 



tics C{k,r±), P{k), and Tr{k,r±) for all k and r± for 
each thermal model. A PCA decomposition is then per- 
formed in order to compress the information present in 
each statistic and represent its variation with the thermal 
parameters using a handful of basis functions 0. A PCA 
is an orthogonal transformation that converts a family 
of correlated variables into a set of linearly uncorrelated 
combinations of principal components. The components 
are ordered by the variance along each basis dimension, 
thus relatively few of them are sufficient to describe the 
entire variation of a function in the space of interest, 
which is here the thermal parameter space. To provide a 
concrete example, the longitudinal power spectrum P{k) 
is fully described by the values of the power in each k 
bin, but it is likely that some of these P{k) values do not 
change significantly given certain combinations of ther- 
mal parameters. The PCA determines basis functions 
of the P{k) that best describe its variation with ther- 
mal parameters, enabling us to represent this complex 
dependence with an expansion onto just a few principal 
components 



P{k\To,j,Xj 



w,(ro,7,Aj)$,(fc), (27) 



where {$(fc)} are the basis of principal components, and 
{uj} are the corresponding coefficients which depend on 
the thermal parameters. The number of components for 
a given function is set by the maximum tolerable inter- 
polation errors of the emulator, and these are in turn set 
by the size of the error bars on the statistic that one is 
attempting to model. We defer a detailed discussion of 
the PCA analysis and the procedure used to determine 
the number of components to an upcoming paper (Ro- 
rai et al. 2013, in prep), but we note that the number of 
PCA components we used to fully represent the functions 
({k,r±), P{k), and 7r(fc,rj_) were 25, 15, and 25, respec- 
tively (phase distribution and cross power spectrum are 
2D functions, so they need more components). 

Gaussian process interpolation is then used to interpo- 
late these PCA coefficients Wi(To, 7, Aj) from the irregu- 
lar distribution of points in our thermal grid to any loca- 
tion of interest in the parameter space. The only input 
for the Gaussian interpolation is the choice of smoothing 
length, which quantifies the degree of smoothness of each 
function along the direction of a given parameter in the 
space. We choose these smoothing lengths to be a mul- 
tiple of the spacing of our parameter grid. The choice of 
these smoothing lengths is somewhat arbitrary, but we 
checked that the posterior distributions of thermal pa- 
rameters (eqn. I26p inferred do not change in response to 
a reasonable variations of these smoothing lengths. A 
full description of the calibration and testing of the em- 
ulator is presented in an upcoming paper (Rorai et al. 
2013, in prep). 

To summarize, our method for measuring the Jeans 
scale of the IGM involves the following steps: 

• Calculate the phase differences 9{k,r±) for each k- 
mode of an observed sample of quasar pairs with 
separations r±. 

• Generate Lya forest quasar pair spectra for a 
grid of thermal models in the parameter space 
(To,7, Aj), using our IGM simulation framework. 
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For each model, numerically determine the con- 
centration parameter (^{k,r±\TQ,j, Xj) at each 
wavenumber k and separation r±, from the distri- 
bution of phase differences 6{k,r±). 

• Emulate the function jTo, 7, Aj), enabling 
fast interpolation of C from the fixed values in the 
thermal parameter grid to any location in thermal 
parameter space. 

• Calculate the posterior distribution in eqn. ([^ for 
A J, by exploring the likelihood function in eqn. (j25l) 
with an MCMC algorithm. 

6. HOW WELL CAN WE MEASURE THE JEANS SCALE? 

Our goal in this section is to determine the precision 
with which close quasar pair spectra can be used to mea- 
sure the Jeans scale. To this end, we construct a mock 
quasar pair dataset from our IGM simulations and ap- 
ply our new phase angle PDF likelihood formalism to 
it. A key question is how well constraints from our new 
phase angle technique compare to those obtainable from 
alternative measures, such as the cross-power spectrum, 
applied to the same pair sample, or from the longitudi- 
nal power spectrum, measured from samples of individ- 
ual quasars. In what follows, we first present the like- 
lihood used to determine thermal parameter constraints 
for these two additional statistics. Then we describe the 
specific assumptions made for the mock data. Next we 
quantify the resulting precision on the Jeans scale, ex- 
plore degeneracies with other thermal parameters, and 
compare to constraints from these two alternative statis- 
tics. We explore the impact of finite signal-to-noise ratio 
and spectral resolution on our measurement accuracy, 
and discuss possible sources of systematic error. Finally, 
we explicitly demonstrate that our likelihood estimator 
provides unbiased determinations of the Jeans scale. 

6.1. The Likelihood for P{k) and Tr{k,rj_) 

For the longitudinal power P{k), we assume that the 
distribution of differences, between the measured band 
powers of a fc-bin and the true value, is a multi-variate 
Gaussian (see e.g. iMcDonald et aLll2006[ ). which leads to 
the standard likelihood for the power-spectrum 



^(Pd|A/) = (27r)-^/2det(I]) 



'1/2 



(28) 



exp 



^-'{Pd-P. 



where Pd is a vector of N observed ID band powers, 
Pm is a vector of power spectrum predictions for a given 
thermal model M = (Tq, 7, Aj), and 

I](fc, k') - ([P(fc) - PM{k)][P{k') ~ PM{k')]), (29) 

is the full covariance matrix of the power spectrum mea- 
surement. As we describe in the next subsection, we will 
choose a subset of the skewers from a fiducial thermal 
model to represent the 'data' in this expression, which 
are then compared directly to thermal models (Tq, 7, Aj), 
where the same emulator technique described in § 15.31 is 
used to interpolate PM(fc|Tb, 7j Aj) to parameter loca- 
tions in the thermal space. To determine the covari- 
ance of this mock data T,{k,k'), we use the full ensem- 
ble of 2 X 10, 000 ID skewers for the fiducial thermal 



model, directly evaluate the covariance matrix, and then 
rescale it to the size of our mock dataset by multiply- 
ing by the ratio of the diagonal terms '''^a^aset/'^^fuir 
This procedure of evaluating the covariance implicitly as- 
sumes that the only source of noise in the measurement is 
sample variance, or that the instrument noise is negligi- 
ble. For the high-resolution and high signal-to-noise ra- 
tio spectra used to measure the longitudinal power spec- 
trum cutoff (McDonald et al. 2000; Croft ct al. 2002), 
this is a reasonable assumption. For reference, the rel- 
ative magnitude of off-diagonal terms of the covariance, 
j:{k,k')/y/Y.{k,k)j:{k',k'), are at most 20 - 30% with 
the largest values attained at the highest k. 

For the cross-power spectrum 7r(fc,rj^), we follow the 
same procedure. Namely, a mock dataset is constructed 
for the fiducial thermal model by taking a subset of the 
full ensemble of quasar pair spectra. We again assume 
that the band power errors are distributed according to 
a multi-variate Gaussian, but because we must now ac- 
count for the variation with separation r±, the corre- 
sponding likelihood is 



^(^|M) = J|^(7rrf(fc,rj.,,)|M), 



(30) 



where ^{TTd{k,r±^i)\M) has the same form as the lon- 
gitudinal power in eqn. ()29|) . In exact analogy with the 
longitudinal power, we compute the full covariance ma- 
trix S(fc, k'\r±) of the cross-power using our full ensemble 
of simulated pair spectra for our fiducial model, but now 
at each value of r j_ . 

6.2. Mock Datasets 

To determine the accuracy with which we can measure 
the Jeans scale and study the degeneracies with other 
thermal parameters, we construct a dataset with a real- 
istic size and impact parameter distribution, and use an 
MCMC simulation to explore the phase angle likelihood 
in eqn. ((25|) . We compare these constraints to those ob- 
tained from the cross-power spectrum for the same mock 
pair dataset, by similarly using an MCMC to explore the 
cross-power likelihood in eqn. (|30|) . We also compare to 
parameter constraints obtainable from the longitudinal 
power alone, by exploring the likelihood in eqn. (j29|) . for 
which we must also construct a mock dataset for longi- 
tudinal power measurements. 

For the mock quasar pair sample, we assume 20 
quasar pair spectra at z — 3, with fully overlapping 
absorption pathlength between Lya and Ly/3. Any real 
quasar pair sample will be composed of both binary 
quasars with full overlap and projected quasar pairs 
with partial overlap, so in reality 20 represents the total 
effective pair sample, whereas the actual number of 
quasar pairs required could be larger. The distribution 
of transverse separations for these pairs is taken to be 
uniform in the range 24 < r± < 714 kpc. Specifically, 
we require 200 pairs of skewers in order to build up 
the necessary path length for 20 full Lya forests, and 
these are randomly selected from our 10,000 IGM pair 
skewers which have 30 discrete separations. We draw 
these pairs from a simulation with a fiducial thermal 
model (To,7,Aj) = (12, 000 K, 1.0, 110, kpc), which lies 
in the middle of our thermal parameter grid. Note that 
follow-up observations of quasar pair candidates has 
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resulted in samples of > 400 quasar pairs in the range 
1.6 < z < 4.3 with < 700 kpc, and for those with 
> 50% overlap, the total effective number of full y over- 
lapping pairs is c± 300 ( Hennawi 2004; Hcnnaw i et al.l 
l2006btlMrers et al.l [200l IHennawi et al.l 12009 ) . Many 
of these sightlines already have the high quality Lya 
fores t spectra required for a Jeans scale measurement 
e^g. IHen nawi et al.|[2006at IHennawi fc Prochaskall2007l: 



Prochask a k, Hennawil 20091 : IHennawi fc Prochaskal 



20081 : IProchaska et al.Tl20lir hence the mock dataset we 



have assumed already exists, and can be easily enlarged 
given the number of close quasar pairs known. 

Longitudinal power spectrum measurements which 
probe the small-scale cutoff of the power have been 
performed on high-resolution {R ~ 30, 000 — 50, 000; 
FWHM=6 — 10 km s~ ) spectra of the brightest quasars. 
Typically, the range of wavenumbers used for model 
fitting is 0.005s km~"^ < fc < 0.1 skm"^ (see Fig- 
ure [2]), where the low-fc cutoff is chosen to avoid sys - 
tematics related to quasar continuum fitting (!Leell2012l ). 
and the high-fc cutoff is adopted to mitigate contam- 
ination from metal absorption lines (|McDonald et al.l 
l2000tlCroft eTaI1l200llKim et al.ll2004D . Because quasar 
pairs are very rare, one must push to faint magni- 
tudes to find them in sufficient numbers. In contrast 
with the much brighter quasar s used to measure the 
small-scale longitudinal power ( McDonald et al.l 120001 : 
ICroft et al.l[20?)2l : IKim et alMm . quasar pairs are typ- 
ically too faint to be observable at echelle resolution 
(FWHM=6 — 10 km s~^) on 8m class telescopes. How- 
ever, quasar pairs can be observed with higher efficiency 
echellette spectrometers, which deliver R ~ 10, 000 or 
FWHM= 30 km s"^ The cutoff in the power spec- 
trum induced by this lower resolution is fcrcs = 1 / Crcs = 
2.358/FWHM = 0.08 s km"\ which is very close to the 
upper limit k < 0.1s km""'^ set by metal-line contami- 
nation. For these reasons, we will consider only modes 
in the range 0.005 s km~^ < k < 0.1s km~^ in the like- 
lihood in eqn. (|25l). We initially consider perfect data, 
ignoring the effect of finite signal-to-noise rate and res- 
olution. Then in § 16.41 we will explore how noise and 
limited spectral resolution influence our conclusions. 

For the mock sample used to study the longitudi- 
nal power, we assume perfect data, which is reasonable 
considering that such analyses are typically performed 
on spectra with signal-to- noise ratio S/N ~ 30 and 
resolution F WH M= 6 km s~^ (iMcDonald et all 120001: 
ICroft et al.l 120021: iKim et al.l |200j) such that the Lya 
forest, and in particular modes with k < 0.1, are fully 
resolved. For the size of this sample, we again assume 20 
individual spectra at z = 3 with full coverage of the Lya 
forest, which is abou t twice the size employed in recently 
published analyses (IMcDonald et all 120001 : ICroft et all 
120021: IKim et all I2004D . However, the number of exist- 
ing archival high-resolution quasar spectra at z = 3 eas- 
ily exceeds this number, so samples of this size are also 
well within reach. Also, adopting a sample for the lon- 
gitudinal power with the same Lya forest path length 
as the quasar pair sample, facilitates a straightforward 
comparison of the two sets of parameter constraints. 

6.3. The Precision of the Xj Measurement 



Given our mock dataset and the expression for the 
phase angle likelihood in eqn. ((25|) . and armed with 
our IGM emulator, which enables us to quickly evaluate 
this likelihood everywhere inside our thermal parameter 
space, we are now ready to explore this likelihood with an 
MCMC simulation to determine the precision with which 
we can measure the Jeans scale and explore degeneracies 
with other thermal parameters. 

We emp loy the publicly availabl e MCM C package de- 
scribed in iForeman-Mackev et all (|2012[ ) , which is par- 
ticularly well adapted to explore parameter degeneracy 
directions. The result of our MCMC simulation is the full 
posterior distribution in the 3-dimensional Tq — 7 — Aj 
space for each likelihood that we consider. It is impor- 
tant to point out that, in general, these posterior distri- 
butions will not be exactly centered on the true fiducial 
thermal model (ro,7,Aj) = 12, 000 K, 1, 110, kpc. In- 
deed, the expectation is that the mean or mode of the 
posterior distribution for a given parameter will scatter 
around the true fiducial value at a level comparable to 
the width of this distribution. Nevertheless, the poste- 
rior distribution should provide an accurate assessment 
of the precision with which parameters can be measured 
and the degeneracy directions in the parameter space. 
In § 16.61 we will demonstrate that our phase angle PDF 
likelihood procedure is indeed an unbiased estimator of 
the Jeans scale, by applying this method to a large en- 
semble of mock datasets, and showing that on average, 
we recover the input fiducial Jeans scale. 

The red shaded regions in Figure |9| show the con- 
straints in thermal parameter space resulting from 
our MCMC exploration of the phase angle likelihood 
(eqn. [25]) . The results are shown projected onto the 
To — Xj and 7 — Aj planes, where the third parameter (7 
and Tq, respectively) has been marginalized over. The 
dark and light shaded regions show 65% and 96% con- 
fidence levels, respectively. The phase difference tech- 
nique (red) yields essentially horizontal contours, which 
pinpoint the value of the Jeans scale, with minimal de- 
generacy with other thermal parameters. This is a direct 
consequence of the near independence of the phase an- 
gle PDF of Tq and 7 shown in Figure |6l and discussed 
in § 14.41 The physical explanation for this independence 
is that 1) the non- linear FGPA transformation in only 
weakly dependent on temperature 2) phase angles are 
invariant to the thermal broadening convolution. This 
truly remarkable result is the key finding of this work: 
phase angles of the Lya forest flux provide direct con- 
straints on the 3D smoothing of the IGM density inde- 
pendent of the other thermal parameters governing the 
IGM. 

The blue shaded regions in Figure |9| show the corre- 
sponding parameter constraints for our MCMC of the 
longitudinal power spectrum likelihood (eqn. |29|). Con- 
sidering the longitudinal power spectrum alone, we find 
that significant degeneracies exist between Aj, Tq and 
7, which confirms our qualitative discussion of these de- 
generacies in § 13.11 and illustrated in Figure |2| These 
degeneracy directions are easy to understand. The lon- 
gitudinal power is mostly sensitive to thermal parame- 
ters via the location of the sharp small-scale cutoff in the 
power spectrum. This thermal cutoff is set by a combina- 
tion of both 3D Jeans pressure smoothing and ID ther- 
mal broadening. The thermal broadening component is 
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Fig. 9. — Constraints on the 7 — Aj and To — Aj planes. The contours show the estimated 65% and 96% confidence levels obtained 
with the longitudinal power (blue) and the phase difference (red). The white dot marks the fiducial model in the parameter space. The 
degeneracy affecting the ID power already shown in figure [2] can now be seen clearly in the parameter space through the inclination of the 
black contours. Conversely, the fact that constraints given by the phase difference statistic are horizontal guarantees that this degeneracy 
is broken and that the measurement of the Jeans scale is not biased by the uncertainties on the equation of state. 




Fig. 10. — Constraints on the 7 — Aj and To — Aj planes. The contours show the estimated 65% and 96% confidence levels obtained with 
the longitudinal power (blue) and the cross power (green). The white dot marks the fiducial model in the parameter space. Comparing 
this plot with figure |9] makes clear why the cross power spectrum is not the optimal statistic for measuring Aj since the phase information 
is diluted and the degeneracy is not efficiently broken. 



set by the temperature of the IGM at the characteristic 
overdensity pr obed by the forest, which is 5 ~ 2 at = 3 
([Becker et aLll2(Ii ii) . One naturally expects a degeneracy 
between Tq and 7, because it is actually the tempera- 
ture at T{S « 2) that sets the thermal broadening. A 
degeneracy between Xj and T{6 w 2) is also expected 
because both smoothings contribute to the small-scale 
cutoff. Thus, a lower Jeans scale can be compensated by 
more thermal broadening, which can result from either a 
steeper temperature density relation (larger 7) or a hot- 
ter temperature at mean density To, since both produce 



a hotter T{6 « 2). 

Previous work that has aimed to measure thermal 
parameters such a s Tq and 7, from the lo ngitudinal 
powe r spectrum (jZaldarriaga et al.] 120011: IViel et al.l 
I2009D . the curvature s ta tistic (IBecker et al.l I2011D . 

(iTheuns et al.l l2002bl: llTdz et al.l 120091: 
2012"), and th e ^-parameter distri- 



wavelets 
iGarzilli et al. 

butio n ^(Hachnelt & Stcinm etz) [1 998; Theuns et alj 

■2000;5 "Ricotti et al. 2000; Brvan & Machacck 200fl 
i Schave et al, .2000; McDonald et al..,2001; , Theuns et atl 
l2002aHRudie et al.ll2012t) . have for the most part ignored 
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the degeneracies between these thermal parameters and 
the Jeans scale (but see Zaldarriaga et al. 2001 who 
marginahzed over the Jeans scale, and Becker et aL 2011 
who also considered its impact). Neglecting the possible 
variation of the Jeans scale is equivalent to severely 
restricting the family of possible IGM thermal histories. 
Because the phase angle method accurately pinpoints 
the Jeans scale independent of the other parameters, 
it breaks the degeneracies inherent to the longitudinal 
power spectrum and will enable accurate and unbiased 
measurements of both Tq and 7, as evidence by the 
intersection of the red and black contours in Figure [9l 
Similar degeneracies between the Jeans scale and {Tq,j) 
exist when one considers other sta ti stics such as the 
flux PDF (IMcDonald et all [2OOOI: IKim et all [20071: 



Bolton et al.1 l2008t iCalura et al.l l2012t iGarzilh et al l 
201211 . which we will explore in an upcoming study 
(Rorai et al. 2013, in prep). In light of these significant 
degeneracies with the Jeans scale, it may be necessary 
to reassess the reliability and statistical significance of 
previous measurements of To and 7. 

Figure [TOl shows the resulting thermal parameter con- 
straints for our MCMC analysis of the cross-power spec- 
trum likelihood (eqn. I30p in green, determined from ex- 
actly the same mock quasar pair sample that we ana- 
lyzed for the phase angles. The confidence regions for 
the longitudinal power are shown for comparison in blue. 
The cross-power spectrum is a straightforward statistic 
to measure and fit models to, and the green confidence 
regions clearly illustrate that it does exhibit some sensi- 
tivity to the Jeans scale, as discussed in § 13.21 and shown 
in the right panel of Figure [21 However, a comparison of 
the cross-power confidence regions in Figure [T0| (green) 
with the phase angle PDF confidence regions in Figure [1| 
(red) reveals that there is far more information about the 
Jeans scale in quasar pair spectra than can be measured 
with the cross-power. The cross-power produces con- 
straints which are effectively a hybrid between the hori- 
zontal Jeans scale contours for the phase angle distribu- 
tion and the diagonal banana shaped contours produced 
by the longitudinal power, which refiects the degeneracy 
between Jeans smoothing and thermal broadening. This 
quantitatively confirms our argument in § 14. 1[ that the 
cross-power is a product of moduli, containing informa- 
tion about the ID power, and the cosine of the phase, 
which depends on the 3D power. 

The results of this section indicate that among the 
statistics that we have considered, the phase angle PDF 
is the most powerful for constraining the IGM pressure 
smoothing, because it is more sensitive to the Jeans scale 
and results in constraints that are free of degeneracies 
with other thermal parameters. We demonstrate this 
explicitly in Figure [Tl] where we show the fully marginal- 
ized posterior distribution (see eqn. [26l) of the Jeans scale 
for each the statistics we have considered. The probabil- 
ity distributions quantify the visual impression from the 
contours in Figures [9| and [TOl and clearly indicate that 
the phase angle PDF is the most sensitive. The rela- 
tive error on the Jeans scale cr\/Xj = 3.9%, which is a 
remarkable precision when compared to the typical pre- 
cision ~ 30% of measurements of Tq a nd 7 in the pub- 
lished literature (see e.g. Figure 30 in lLidz et all[200l 
for a recent compilation), especially when one considers 
that only 20 quasar pair spectra are required to achieve 
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Fig. 11. — Estimated accuracy on the measurement of Aj, 
obtained marginalizing over To and 7 the posterior distribution 
from the MCMC analysis. The phase difference statistic (red) sets 
tighter constraints than the cross power (blue) and the longitudi- 
nal power (black), which are affected by parameter degeneracies. 
In this case we do not account for the effect of noise and limited 
resolution, and we find a relative precision of 3.9% for Aj. 

this accuracy. 

We close this section with a caveat to our statements 
that our Jeans scale constraints are free of degeneracies 
with other thermal parameters. The phase angle PDF is 
explicitly nearly independent of the temperature-density 
relation because 1) the non-linear FGPA transformation 
is only weakly dependent on temperature and 2) the 
phase angle PDF is invariant to the thermal broaden- 
ing convolution (sec § 14. 4p . However, in our idealized 
dark-matter only simulations, the Jeans scale is taken to 
be completely independent of To and 7; whereas, in real- 
ity all three parameters are correlated by the underlying 
thermal history of the Universe. In this regard, the Jeans 
scale may implicitly depend on the Tq and 7 at the red- 
shift of the sample, as well as with their values at earlier 
times. We argued that because the thermal history is 
not known, taking the Jeans scale to be free parameter 
is reasonable. However, the validity of this assumption 
and the implicit dependence of the Jeans scale on other 
thermal parameters is clearly something that should be 
explored in the future with hydrodynamical simulations. 

6.4. The Impact of Noise and Finite Spectral Resolution 

Up until this point we have assumed perfect data with 
infinite signal-to-noise ratio and resolution. This is unre- 
alistic, especially considering, as discussed in § 16.21 that 
that close quasar pairs are faint, and typically cannot be 
observed at echelle resolution or very high signal-to-noise 
ratio > 20, even with 8m class telescopes. In this section 
we explore the impact of noise and finite resolution on 
the precision with which we can measure the Jeans scale. 

We consider the exact same sample of 20 mock 
quasar spectra, but now assume that they are ob- 
served with spectral resolution corresponding to FWHM 
= 30 km s~^, and two different signal-to-noise ratios of 
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Fig. 12. — The effect of noise and resolution in the measurement 
of A J. The plots shows the posterior distribution of the Jeans scale, 
marginalized over To and 7. Each line represent a different degree 
of noise, assuming a resolution of FWHM=30 km/s. We selected 
a different subsample of the simulation as our mocfc dataset which 
has a precision of 3.6% for S/N=oo (black solid), 4.8% for S/N=10 
(green dot-dashed) and 7.2% for S/N=5 (red dashed). 

S/N ~ 5 and S/N ~ 10 per pixel. These values are 
consistent with what could be achieved using an echel- 
lette spectrometer on an 8m class telescope. To create 
mock observed spectra with these properties, we first 
smooth our simulated spectra with a Gaussian kernel 
to model the limited spectral resolution, and interpo- 
late these smoothed spectra onto a coarser spectral grid 
which has 10 km pixels, consistent with the spectral 
pixel scale of typical echellette spectrometers. We then 
add Gaussian white noise to each pixel with variance 
determined by the relation S/N = F/ai^, where F is 
the mean transmitted flux. This then gives an average 
signal-to-noise ratio equal to the desired value. 

As we already discussed in § 14.41 in the context of ther- 
mal broadening, phase angles are invariant under a con- 
volution with a symmetric Gaussian kernel. Thus we do 
not expect spectral resolution to significantly influence 
our results, provided that we restrict attention to modes 
which are marginally resolved, such that we can measure 
their phases. Indeed, the cutoff in the flux power spec- 
trum induced by spectral resolution is kres = 1/cres ~ 
2.358/FWHM = 0.08 s km"\ is comparable to the maxi- 
mum wavenumber we consider k = 0.1s km~^, and hence 
we satisfy this criteria. Note further that this invariance 
to a symmetric spectral convolution implies that we do 
not need to be able to precisely model the resolution, 
provided that it has a nearly symmetric shape and does 
not vary dramatically across the spectrum. This is an- 
other significant advantage of the phase angle approach, 
since the resolution of a spectrometer often depends on 
the variable seeing, and can be challenging to accurately 
calibrate. 

Although our results are thus likely to be very indepen- 
dent of resolution, noise introduces fluctuations which 



are uncorrelated between the two sightlines, and this will 
tend to reduce the coherence of the flux that the phase 
angle PDF quantifies. Noise will thus modify the shape 
of the phase angle PDF away from the intrinsic shape 
shown in Figure [5l In order to deal with noise and its 
confluence with spectral resolution, we adopt a forward- 
modeling approach. Speciflcally, for each thermal model 
we smooth all 10,000 IGM skewers to finite resolution, in- 
terpolate onto coarser spectral grids, and add noise con- 
sistent with our desired signal-to-noise ratio. We then fit 
the resulting distribution of phase angles to the wrapped- 
Cauchy distribution, determining the value of the con- 
centration parameter C(A:, rj,), at each k and r± as we did 
before. We again emulate the function ({k,r±\To,-f, Xj) 
using the same thermal parameter grid, but now with 
noise and spectral resolution included, enabling fast eval- 
uations of the likelihood in eqn. (1251) . Thermal parameter 
constraints then follow from MCMC exploration of this 
new likelihood, for which the impact of noise and resolu- 
tion on the phase angle PDF have been fully taken into 
account. 

In Figure [T^] we show the impact of noise on the fully 
marginalized constraints on the Jeans scale from the 
phase angle PDF. The solid curve represents the pos- 
terior distribution for a mock dataset with infinite reso- 
lution and signal-to-noise ratio, which is identical to the 
red curve in Figure 1111 The dotted and dashed curves 
illustrate the impact of S/N =10 and S/N = 5, respec- 
tively. Note that the slight shift in the modes of these 
distributions from the fiducial value are expected, and 
should not be interpreted as a bias. Different noise re- 
alizations generate scatter in the phase angles just like 
the intrinsic noise from large-scale structure. The in- 
ferred Jeans scale for any given mock dataset or noise 
realization will not be exactly equal to the true value, 
but should rather be distributed about it with a scatter 
given by the width of the resulting posterior distribu- 
tions. The relative shifts in the mode of the posterior 
PDFs are well within la of the fiducial value, and are 
thus consistent with our expectations. 

The upshot of Figure [T2] is that noise and limited spec- 
tral resolution do not have a significant impact on our 
ability to measure the Jeans scale. For a signal-to-noise 
ratio of S/N = 10 per pixel we find that the relative 
precision with which we can measure the Jeans scale is 
""a/Aj = 4.8%, which is only a slight degradation from 
the precision achievable from the same dataset at infi- 
nite signal-to-noise ratio and resolution a\/\j — 3.9%. 
The small impact of noise on the Jeans scale precision 
is not surprising. For the 10 km s~^ spectral pixels that 
we simulate, the standard deviation of the normalized 
Lya forest flux per pixel is -y/ {SF"^) ~ 32%, whereas for 
S/N ~ 10 our Gaussian noise fluctuations are at a sig- 
nificantly smaller ~ 10% level. Heuristically, these two 
'noise' sources add in quadrature, and thus the primary 
source of 'noise' in measuring the phase angle PDF re- 
sults from the Lya forest itself, rather than from noise in 
the data. For a lower signal-to- noise ratio of S /N — 5 per 
pixel, the precision is further degraded to (J\/Xj = 7.2%, 
which refiects the fact that noise fluctuations are becom- 
ing more comparable to the intrinsic Lya forest fluctua- 
tions. 

These numbers on the scaling of our precision with 
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Fig. 13. — The effect of overestimating the signal-to-noise ratio 
by a 20% factor (red, dashed line) when the real value is S/N= 10: 
we do not find any significant bias on the measured value of the 
Jeans scale. 

signal-to- noise ratio S /N provide intuition about the op- 
timal observing strategy. For a given sample of pairs, it 
will require four times more exposure time to increase the 
signal-to-noise ratio from S/N ~ 5 to S/N ~ 10, whereas 
the same telescope time allocation could be used to in- 
crease the sample size by a factor of four at the same 
signal-to-noise ratio (assuming sufficient close pair sight- 
lines exist). For the latter case of an enlarged sample, the 
precision will scale roughly as oc yj iVpairs , implying a (Ta / 
Aj = 3.6% for a sample of 80 pairs observed at S/N — 5. 
This can be compared to cr\/\,j — 4.8% for 20 pairs 
observed at S/N ~ 10. There is thus a marginal gain 
in working at lower S/N ~ 5 and observing a larger pair 
sample, although we have not considered various system- 
atic errors which could impact our measurement. How- 
ever, higher signal-to-noise spectra are usually preferable 
for the purposes of mitigating systematics, and hence 
one would probably opt for higher signal-to-noise ratio, 
a smaller pair sample, and tolerate slightly higher statis- 
tical errors. 

6.5. Systematic Errors 

We now briefly discuss the systematic errors which 
could impact a measurement of the Jeans scale. First, 
consider the impact of errors in the continuum normal- 
ization. Because the phase angle is a ratio of Fourier 
modes of the normalized flux eqn. (|16|) . it is completely 
insensitive to the continuum normalization of 5F , pro- 
vided that the continuum is not adding significant power 
on the scale of wavelength of the fc-mode considered. In 
the previous section, we argued that finite spectral reso- 
lution does not have a significant impact the phase angle 
PDF, because phase angles are invariant under convo- 
lutions with symmetric kernels. We do take resolution 
into account in our forward-modeling of the phase angle 
PDF, but precise knowledge of the spectral resolution or 
the line spread function is not required, since the line 



spread function will surely be symmetric when averaged 
over several exposures, thus leaving the phase angles in- 
variant. The only requirement is that we restrict atten- 
tion to modes less than the resolution cutoff k < k-^cs 
whose amplitudes are not significantly attenuated, such 
that we can actually measure their phase angles. 

Noise does modify the phase angle PDF, but our 
forward-modeling approach takes this fully into account 
provided the noise estimates are correct. One potential 
systematic is uncertainty in the noise model. The typical 
situation is that the standard-deviation of a spectrum re- 
ported by a data reduction pipeline is underestimated at 
the 10 — 20% level (S/N overestimated), because of sys- 
tematic error s related to the instrument and data re duc- 
tion (see e.g. iMcDonald etal|[200l iLee et al.|[20T3l ). To 
address this issue we directly model the impact of under- 
estimated noise for a case where we think the S/N ~ 10, 
but where in reality it is actually 20% lower S/N ~ 8. 
Specifically, using our same mock dataset we generate 
20 quasar pair spectra with S/N ~ 8. However, when 
forward-modeling the phase angle PDF with the IGM 
simulations, we take the signal-to-noise ratio to be the 
overestimated value of S/N ~ 10. Excess noise above 
our expectation would tend to reduce the coherence in 
the spectra (less peaked phase angle PDF) mimicking 
the effect of a smaller Jeans scale. We thus expect a 
bias in the Jeans scale to result from the underestimated 
noise. Figure [T51 compares the posterior distributions of 
the Jeans scale for the two cases S/N ~ 10 (black curve) 
and signal-to- noise ratio overestimated to be S/N ~ 10 
but actually equal to S/N ~ 8 (red curve). We see that 
~ 20% level uncertainties in the noise lead to a negligible 
bias in the Jeans scale. 

The only remaining systematic that could impact the 
Jeans scale measurement is metal-line absorption within 
the forest. Metal absorbers cluster differently from the 
IGM, and it is well known that metals add high-fc power 
to the Lya forest power spectrum because the gas traced 
by metal lines tends to b e colder than H I in the 
IGM (McDonald ct al. 2000; Croft et al. 2002; Kim et al.l 
'2004!: lLidz et al.ll2009f ). As this metal absorption is not 
present in our IGM simulations, it can lead to discrep- 
ancies between model phase angle PDFs and the actual 
data, resulting in a biased measurement. This is very 
unlikely to be a significant effect. We restrict attention 
to large scale modes with k < 0.1 s km~^, both because 
this is comparable to our expected spectral resolution 
cutoff, and because below these wavenumbers metal line 
absorption results in negligible contamina tion of the lon- 
gitudinal power (McDonald et al. 2000; Cr oft et al.ll2002l: 
IKim et all 120041: ILidz et al.l 120091) . Since the metal ab- 
sorbers have a negligible effect on the moduli of these 
large scale modes, we also expect them to negligibly 
change their phase angles. 

We thus conclude that the phase angle PDF is highly 
insensitive to the systematics that typically plague Lya 
forest measurements, such as continuum fitting errors, 
lack of knowledge of spectral resolution, poorly cali- 
brated noise, and metal line absorption. 

6.6. Is Our Likelihood Estimator Unbiased? 

Finally, we determine whether our procedure for mea- 
suring the Jeans scale via the phase angle likelihood 
(eqn. [25]) outlined at the end of § 15.31 produces unbiased 
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estimates. To quantify any bias in our Jeans scale esti- 
mator we follow a Monte Carlo approach, and generate 
400 distinct quasar pair samples by randomly drawing 
20 quasar pair spectra (allowing for repetition) from our 
ensemble of 10,000 skewers. Note that the distribution 
of transverse separations is approximately the same for 
all of these realizations, since we only simulate 30 dis- 
crete separations, and the full sample of 20 overlapping 
pair spectra requires 200 pairs of skewers, which are ran- 
domly selected from among the 30 available pair separa- 
tions. We MCMC sample the likelihood in eqn. ([25]) for 
each realization, and thus generate the full marginalized 
posterior distribution (eqn. [551 red curve in Figure [TT|) . 
The 'measured' value of the Jeans scale for each realiza- 
tion is taken to the be the mean of the posterior distribu- 
tion. We conducted this procedure for the case of finite 
spectral resolution (FWHM = 30 km s~^) and signal-to- 
noise ratio S/N ~ 5, where our forward-modeling proce- 
dure described in § 16.41 is used to model the impact of 
resolution and noise on the phase angle PDF. 

The distribution of Jeans scale measurements resulting 
from this Monte Carlo simulation is shown in Figure [TH 
We find that the distribution of 'measurements' is well 
centered on the true value of Aj = 110 kpc, and the mean 
value of this distribution is Aj = 111.1 kpc, which dif- 
fers from the true value by only 1%, confirming that our 
procedure is unbiased to a very high level of precision. 
The relative error of our measurements from this Monte 
Carlo simulation is (J\j/Xj = 6.3%, which is consistent 
with the value of a\j/\j — 7.2%, which we deduced in 
S I6.3l from an MCMC sampling of the likelihood for a sin- 
gle mock dataset. This confirms that the posterior dis- 
tributions derived from our MCMC do indeed provide an 
accurate representation of the errors on the Jeans scale 
and other thermal parameters. However, we note that 
there is some small variation in the value of <J\,/\j in- 
ferred from the posterior distributions for different mock 
data realizations, as expected. Given that we only gen- 
erated 400 samples, the error on our determination of 
the mean of the distribution in Figure [14] is ~ axj/Xj / 
\/400 = 0.3%, and thus our shght bias of 1% constitutes 
a ^ 3ct fluctuation. We suspect that this is too large to 
be a statistical fluke, and speculate that a tiny amount 
of bias could be resulting from interpolation errors in our 
emulation of the IGM. It is also possible that choosing 
an alternative statistic of the posterior distribution as 
our 'measurement' instead of the mean, for example the 
mode or median, could also further reduce the bias. But 
we do not consider this issue further, since the bias is so 
small compared to our expected precision. 

We conclude that our phase angle PDF likelihood pro- 
cedure for estimating the Jeans scale has a negligible 
~ 1% bias. We would need to analyze a sample of 
~ 500 — 1000 quasar pair spectra for this bias to be com- 
parable to the error on the Jeans scale. Furthermore, 
it is likely that we could, if necessary, reduce this bias 
even further by either reducing the interpolation error in 
our emulator or by applying a different statistic to our 
posterior distribution to determine the measured value. 

7. DISCUSSION AND SUMMARY 

Spectra of correlated Lya forest absorption in close 
quasar pair sightlines represent a unique opportunity to 
improve our understanding of the physics governing the 
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Fig. 14. — Probability distribution of the measured value of A j 
for 400 different mock datasets drawn from the fiducial simulation. 
This plot confirms that our method is not biased, since the dis- 
tributions is be centered at the true value, marked with a vertical 
dashed line. This test is performed assuming S/N= 5. The red line 
is the posterior distribution deduced from our MCMC sampling of 
the phase angle PDF likelihood for one of these 400 mock dataset 
realizations. Its similarity in shape to the distribution of mock 
measurements illustrates that our MCMC simulations provide re- 
liable error estimates. 

IGM. In this paper we have shown that the degree of 
coherence of Lya absorption in quasar pair spectra is 
sensitive to the Jeans filtering scale, provided the pair 
separation is small enough to resolve it. Although the 
Jeans scale has never been measured, it has fundamental 
cosmological implications: it provides a thermal record of 
heat injected by ultraviolet photons during cosmic reion- 
ization events, determines the dumpiness of the IGM, a 
critical ingredient in reionization models, and sets the 
minimum mass of galaxies to gravitationally collapse 
from the IGM. 

We introduce a novel technique to directly measure 
the Jeans scale from quasar pair spectra based on the 
probability distribution function (PDF) of phase angle 
differences of homologous longitudinal Fourier modes in 
close quasar pair spectra. To study the efficacy of this 
new method, we combined a semi-analytical model of the 
Lya forest with a dark matter only simulation, to gener- 
ate a grid of 500 thermal models, where the temperature 
at mean density Tq, slope of the temperature-density re- 
lation 7, and the Jeans scale Aj were varied. A Bayesian 
formalism is introduced based on the phase angle PDF, 
and MCMC techniques are used to conduct a full param- 
eter study, allowing us to characterize the precision of a 
Jeans scale measurement, explore degeneracies with the 
other thermal parameters, and compare parameter con- 
straints with those obtained from other statistics such as 
the longitudinal power and the cross-power spectrum. 

The primary conclusions of this study are: 

• The longitudinal power is highly degenerate with 
respect to the thermal parameters Tq, 7 and Aj, 
which arises because thermal broadening smooths 



25 



the IGM along the hne-of-sight (ID) at a compa- 
rable scale as the Jeans pressure smoothing (3D). 
It is extremely challenging to disentangle this con- 
fluence of ID and 3D smoothing with longitudinal 
observations alone. Similar analogous degenera- 
cies are likely to exist in other previously consid- 
ered statistics sensitive to small-scale power such 
as the wavelet decomposition, the curvature, the 
6-parameter distribution, and the flux PDF. Hence 
it may be necessary to reassess the reliability and 
statistical significance of previous measurements of 
To and 7. 

• The cross-power measured from close quasar pairs 
is sensitive to the 3D Jeans smoothing, and can 
break degeneracies with the unknown Jeans scale. 
However, it is not the optimal statistic, because it 
mixes ID information in the moduli of longitudinal 
Fourier modes, with the 3D information encoded in 
their phase differences. We show that by focusing 
on the phase differences alone, via the full PDF 
of phase angles, one is much more sensitive to 3D 
power and the Jeans smoothing. 

• Based on a simple heuristic geometric argument, 
we derived an analytical form for the phase an- 
gle PDF. A single parameter family of wrapped- 
Cauchy distributions provides a good fit to the 
phase differences in our simulated spectra for any 
k, rj_, the full range of To,7 and Aj. 

• Our phase angle PDFs indicate that phase differ- 
ences between large-scale longitudinal modes with 
small wavenumbers k <^ 1/Aj, are nevertheless 
very sensitive to the Jeans scale. We present a sim- 
ple analytical argument showing that this sensitiv- 
ity results from the geometry of observing a 3D field 
along ID skewers: low-A: cross-power across corre- 
lated ID skewers is actually dominated by high-Zc 
3D modes up to a scale set by the pair separation 
k^ ^ 

• The phase angle PDF is essentially independent 
of the temperature-density relation parameters Tg 
and 7. This results because 1) the non-linear 
FGPA transformation is only weakly dependent on 
temperature 2) phase angles of longitudinal modes 
are invariant to the symmetric thermal broadening 
convolution. 

• Our full Bayesian MCMC parameter analysis indi- 
cates that a realistic sample of only 20 close quasar 
pair spectra observed at modest signal-to-noise ra- 
tio S/N ~ 10, can pinpoint the Jeans scale to ~ 5% 
precision, fully independent of the amplitude Tq 
and slope 7 of the temperature-density relation. 
The freedom from degeneracies with Tq and 7 is 
a direct consequence of the near independence of 
the phase angle PDF of these parameters. 



• Our new estimator for the Jeans scale is unbiased 
and insensitive to a battery of systematics that typ- 
ically plague Lya forest measurements, such as con- 
tinuum fitting errors, imprecise knowledge of the 
noise level and/or spectral resolution, and metal- 
line absorption. 
In order for the parameter study presented here, with 
a large grid (500) of thermal models, to be computation- 
ally feasible, we had to rely on a simplified model of the 
IGM, based on a dark-matter only simulation and sim- 
ple thermal scaling relations. In particular, the impact of 
Jeans pressure smoothing on the distribution of baryons 
is approximated by smoothing the dark-matter particle 
distribution with a Gaussian-like kernel, and we allowed 
the three thermal parameters Tq, 7, and Xj to vary com- 
pletely independently. Although the Gau ssian filtering 
appro ximation is valid in linear theory (jGnedin et al.l 
120031 ). the Jeans scale is highly nonlinear at 2; ~ 3, hence 
a precise description of how pressure smoothing alters the 
3D power spectrum of the baryons requires full hydrody- 
namical simulations. Furthermore, the three thermal pa- 
rameters we consider are clearly implicitly correlated by 
the underlying thermal history of the Universe. Indeed, 
a full treatment of the impact of impulsive reionization 
heating on the thermal evolution of the IGM and the con- 
comitant hydrodynamic response of the baryons, proba- 
bly requires coupled radiative transfer hydrodynamical 
simulations. 

Our approximate IGM model is thus justified by the 
complexity and computational cost of fully modeling 
the Jeans smoothing problem, and despite its simplic- 
ity, it provides a good fit to current measurements of 
the longitudinal power (see Figure [2]). Most impor- 
tantly, our simple model allowed us to develop valuable 
physical intuition about how 3D pressure smoothing of 
baryons is manifest in Lya forest spectra of close quasar 
pairs. Based on this intuition, we devised a powerful 
new method which isolates this small-scale 3D informa- 
tion. By combining this new technique with existing 
close quasar spectra, we will make the first direct mea- 
surement of the Jeans scale of the IGM. Given that pre- 
cise ~ 5% constraints on the Jeans scale will soon be 
available, the time is now ripe to use hydrodynamical 
and radiative transfer simulations to improve our under- 
standing of how reionization heating altered the small- 
scale structure of baryons in the IGM. 



We thank P. McDonald and U. Seljak for first sug- 
gesting to JFH that close quasar pairs could be used to 
measure the Jeans scale. We also thank the members 
of the ENIGMA grouifl at the Max Planck Institute for 
Astronomy (MPIA) for reading an early version of the 
manuscript and for helpful discussions. JFH acknowl- 
edges generous support from the Alexander von Hum- 
boldt foundation in the context of the Sofja Kovalevskaja 
Award. The Humboldt foundation is funded by the Ger- 
man Federal Ministry for Education and Research. 



REFERENCES 



Abel, T., & Haehnelt, M. G. 1999, ApJ, 520, L13 
Barkana, R., & Loeb, A. 1999, ApJ, 523, 54 

http://www.mpia-hd.mpg.de/ENIGMA/ 



— . 2001, Phys. Rep., 349, 125 
Becker, G. D., Bolton, J. S., Haehnelt, M. G., . 
W. L. W. 2011, MNRAS, 410, 1096 



Sargent, 



26 



Benson, A. J., Frcnk, C. S., Laccy, C. G., Baugh, C. M., & Cole, 

S. 2002a, MNRAS, 333, 177 
Benson, A. J., Laccy, C. G., Baugh, C. M., Cole, S., & Frenk, 

C. S. 2002b, MNRAS, 333, 156 
Benson, A. J., & Madau, P. 2003, MNRAS, 344, 835 
Bolton, J., Meiksin, A., & White, M. 2004, MNRAS, 348, L43 
Bolton, J. S., Viel, M., Kim, T., Haehnelt, M. G., & Carswell, 

R. F. 2008, MNRAS, 386, 1131 
Bovy, J., Hennawi, J. F., Hogg, D. W., Myers, A. D., Kirkpatrick, 

J. A., Schlegel, D. J., Ross, N. P., Sheldon, E. S., McGreer, 

I. D., Schneider, D. P., & Weaver, B. A. 2011, ApJ, 729, 141 
Bovy, J., Myers, A. D., Hennawi, J. F., Hogg, D. W., McMahon, 

R. G., Schiminovich, D., Sheldon, E. S., Brinkmann, J., 

Schneider, D. P., & Weaver, B. A. 2012, ApJ, 749, 41 
Broderick, A. E., Chang, P., & Pfrommer, C. 2012, ApJ, 752, 22 
Bryan, G. L., & Machacek, M. E. 2000, ApJ, 534, 57 
Bullock, J. S., Kravtsov, A. V., & Weinberg, D. H. 2000, ApJ, 

539, 517 

Calura, F., Tescari, E., D'Odorico, V., Viel, M., Cristiani, S., 

Kim, T.-S., & Bolton, J. S. 2012, MNRAS, 422, 3019 
Ccn, R., & Bryan, G. L. 2001, ApJ, 546, LSI 

Cen, R., McDonald, P., Trac, H., & Loeb, A. 2009, ArXiv e-prints 
Cen, R., Miralda-Escude, J., Ostriker, J. P., & Ranch, M. 1994, 

ApJ, 437, L9 
Cen, R., & Ostriker, J. P. 1999, ApJ, 514, 1 

— . 2006, ApJ, 650, 560 

Chang, P., Broderick, A. E., & Pfrommer, C. 2012, ApJ, 752, 23 
Chiang, L.-Y., Coles, P., & Nasclsky, P. 2002, MNRAS, 337, 488 
Ciardi, B., & Ferrara, A. 2005, Space Sci. Rev., 116, 625 
Ciardi, B., & Salvatcrra, R. 2007, MNRAS, 381, 1137 
Coles, P. 2009, in Lecture Notes in Physics, Berlin Springer 

Vcrlag, Vol. 665, Data Analysis in Cosmology, ed. V. J. 

Martinez, E. Saar, E. Martinez-Gonzalez, & M.-J. 

Pons-Bordcn'a, 493-522 
Croft, R. A. C, Weinberg, D. H., Bolte, M., Buries, S., Hernquist, 

L., Katz, N., Kirkman, D., & Tytler, D. 2002, ApJ, 581, 20 
Croft, R. A. C, Weinberg, D. H., Katz, N., & Hernquist, L. 1997, 

ApJ, 488, 532 
— . 1998, ApJ, 495, 44 

Dave, R., Cen, R., Ostriker, J. P., Bryan, G. L., Hernquist, L., 
Katz, N., Weinberg, D. H., Norman, M. L., & O'Shea, B. 2001, 
ApJ, 552, 473 

Dave, R., Hernquist, L., Katz, N., & Weinberg, D. H. 1999, ApJ, 
511, 521 

Dijkstra, M., Haiman, Z., Rees, M. J., & Weinberg, D. H. 2004, 
ApJ, 601, 666 

D'Odorico, V., Viel, M., Saitta, F., Cristiani, S., Bianchi, S., 
Boyle, B., Lopez, S., Maza, J., & Outram, P. 2006, MNRAS, 

372, 1333 

Embcrson, J. D., Thomas, R. M., & Alvarez, M. A. 2013, ApJ, 

763, 146 

Fan, X., Carilli, C. L., & Keating, B. 2006, ARA&A, 44, 415 
Faucher-Gigucrc, C. ., Prochaska, J. X., Lidz, A., Hernquist, L., 

& Zaldarriaga, M. 2007, ArXiv c-prints, 709 
Foreman-Mackcy, D., Hogg, D. W., Lang, D., & Goodman, J. 

2012, ArXiv e-prints 
Furlanctto, S. R., & Dixon, K. L. 2010, ApJ, 714, 355 
Furlanctto, S. R., & Oh, S. P. 2009, ApJ, 701, 94 
Garzilli, A., Bolton, J. S., Kim, T.-S., Leach, S., & Viel, M. 2012, 

MNRAS, 424, 1723 
Gncdin, N. Y. 2000a, ApJ, 542, 535 
— . 2000b, ApJ, 542, 535 

Gnedin, N. Y., Baker, E. J., Bcthcll, T. J., Drosback, M. M., 
Harford, A. G., Hicks, A. K., Jensen, A. G., Kccncy, B. A., 
Kelso, C. M., Neyrinck, M. C, Pollack, S. E., & van Vliet, 
T. P. 2003, ApJ, 583, 525 
Gnedin, N. Y., & Hui, L. 1998, MNRAS, 296, 44 
Gunn, J. E., & Peterson, B. A. 1965, ApJ, 142, 1633 
Haardt, F., & Madau, P. 2012, ApJ, 746, 125 
Habib, S., Heitmann, K., Higdon, D., Nakhleh, C, & Williams, 

B. 2007, Phys. Rev. D, 76, 083503 
Haehnelt, M. G., & Steinmetz, M. 1998, MNRAS, 298, L21 
Haiman, Z., Abel, T., & Madau, P. 2001, ApJ, 551, 599 
Heitmann, K., Higdon, D., Nakhleh, C, & Habib, S. 2006, ApJ, 
646, LI 

Heitmann, K., Higdon, D., White, M., Habib, S., Williams, B. J., 
Lawrence, E., & Wagner, C. 2009, ApJ, 705, 156 



Heitmann, K., Lukic, Z., Fasel, P., Habib, S., Warren, M. S., 
White, M., Ahrcns, J., Ankeny, L., Armstrong, R., O'Shea, B., 
Ricker, P. M., Springel, V., Stadcl, J., & Trac, H. 2008, 
Computational Science and Discovery, 1, 015003 

Heitmann, K., White, M., Wagner, C, Habib, S., & Higdon, D. 
2010, ApJ, 715, 104 

Hennawi, J. F. 2004, PhD thesis, Ph.D dissertation, 2004. 232 
pages; United States - New Jersey: Princeton University; 
2004. Publication Number: AAT 3151085. DAI-B 65/10, 
p. 5189, Apr 2005 

Hennawi, J. F., Myers, A. D., Shen, Y., Strauss, M. A., 

Djorgovski, S. G., Fan, X., Glikman, E., Mahabal, A., Martin, 

C. L., Richards, G. T., Schneider, D. P., & Shankar, F. 2009, 
ArXiv e-prints 

Hennawi, J. F., & Prochaska, J. X. 2007, ApJ, 655, 735 
— . 2008, ArXiv Astrophysics e-prints 

Hennawi, J. F., Prochaska, J. X., Buries, S., Strauss, M. A., 
Richards, G. T., Schlegel, D. J., Fan, X., Schneider, D. P., 
ZaJcamska, N. L., Oguri, M., Gunn, J. E., Lupton, R. H., & 
Brinkmann, J. 2006a, ApJ, 651, 61 
Hennawi, J. F., Strauss, M. A., Oguri, M., Inada, N., Richards, 
G. T., Pindor, B., Schneider, D. P., Becker, R. H., Gregg, 
M. D., Hall, P. B., Johnston, D. E., Fan, X., Buries, S., 
Schlegel, D. J., Gunn, J. E., Lupton, R. H., Bahcall, N. A., 
Brunncr, R. J., & Brinkmann, J. 2006b, AJ, 131, 1 
Hoeft, M., Ycpcs, G., Gottlobcr, S., & Springel, V. 2006, 

MNRAS. 371. 401 
Hui, L., & Gncdin, N. Y. 1997, MNRAS, 292, 27 
Hui, L., & Haiman, Z. 2003, ApJ, 596, 9 
Hui, L., Stebbins, A., & Buries, S. 1999, ApJ, 511, L5 
Iliev, I. T., Mellema, G., Pen, U.-L., Merz, H., Shapiro, P. R., & 

Alvarez, M. A. 2006, MNRAS, 369, 1625 
Ilicv, I. T., Scannapicco, E., & Shapiro, P. R. 2005, ApJ, 624, 491 
Inoue, A. K., & Kaniaya, H. 2003, MNRAS, 341, L7 
Jakobsen, P., Boksenberg, A., Deharveng, J. M., Greenfield, P., 

Jedrzejewski, R., & Paresce, F. 1994, Nature, 370, 35 
Jammalamadaka, S. R., & Scngupta, A. 2001 
Kim, T.-S., Bolton, J. S., Viel, M., Haehnelt, M. G., & Carswell, 

R. F. 2007, MNRAS, 382, 1657 
Kim, T.-S., Viel, M., Haehnelt, M. G., Carswell, R. F., & 

Cristiani, S. 2004, MNRAS, 347, 355 
KoUmeier, J. A., Miralda-Escude, J., Cen, R., & Ostriker, J. P. 

2006, ApJ, 638, 52 
Kuhlen, M., & Madau, P. 2005, MNRAS, 363, 1069 
Kulkarni, G., & Choudhury, T. R. 2011, MNRAS, 412, 2781 
Kwan, J., Bhattacharya, S., Heitmann, K., & Habib, S. 2012, 
ArXiv e-prints 

Lawrence, E., Heitmann, K., White, M., Higdon, D., Wagner, C, 

Habib, S., & Williams, B. 2010, ApJ, 713, 1322 
Lcc, K.-G. 2012, ApJ, 753, 136 

Lee, K.-G., Bailey, S., Bartsch, L. E., Carithers, W., Dawson, 
K. S., Kirkby, D., Lundgren, B., Margala, D., 
Palanquc-Dclabrouillc, N., Pieri, M. M., Schlegel, D. J., 
Weinberg, D. H., Ycchc, C, Aubourg, E., Bautista, J., Bizyaev, 

D. , Bloniqvist, M., Bolton, A. S., Borde, A., Brcwington, H., 
Busca, N. G., Croft, R. A. C, Delubac, T., Ebclkc, G., 
Eiscnstcin. D. J., Font-Ribera, A., Ge, J., Hamilton, J.-C, 
Hennawi, J. F., Ho, S., Honscheid, K., Le Goff, J.-M., 
Malanuslicnko, E., Malanushcnko, V., Miralda-Escude, J., 
Myers, A. D., Notcrdacmc, P., Oravctz, D., Pan, K., Paris, L, 
Petitjean, P., Rich, J., RoUinde, E., Ross, N. P., Rossi, G., 
Schneider, D. P., Simmons, A., Sncddcn, S., Slosar, A., Spcrgel, 
D. N., Suzuki, N., Viol, M., & Weaver, B. A. 2013, AJ, 145, 69 

Lidz, A., Faucher-Gigucrc, C, Dall'Aglio, A., McQuinn, M., 
Fcclmer, C, Zaldarriaga, M., Hernquist, L., & Dutta, S. 2009, 
ArXiv e-prints 

Lumsden, S. L., Heavens, A. F., & Peacock, J. A. 1989, MNRAS, 
238, 293 

Madau, P. 2000, in Royal Society of London Philosophical 
Transactions Scries A, Vol. 358, Astronomy, physics and 
chemistry of HJ, 2021 
Madau, P., & Efstathiou, G. 1999, ApJ, 517, L9 
Madau, P., Ferrara, A., & Rccs, M. J. 2001, ApJ, 555, 92 
Madau, P., Haardt, F., & Rccs, M. J. 1999, ApJ, 514, 648 
Madau, P., & Meiksin, A. 1994, ApJ, 433, L53 



27 



McDonald, P., Miralda-Escude, J., Rauch, M., Sargent, W. L. W., 
Barlow, T. A., & Cen, R. 2001, ApJ, 562, 52 

McDonald, P., Miralda-Escude, J., Rauch, M., Sargent, W. L. W., 
Barlow, T. A., Cen, R., & Ostriker, J. P. 2000, ApJ, 543, 1 

McDonald, P., Seljak, U., Buries, S., Schlegel, D. J., Weinberg, 
D. H., Cen, R., Shih, D., Schaye, J., Schneider, D. P., Bahcall, 
N. A., Briggs, J. W., Brinkmann, J., Brunner, R. J., Fukugita, 
M., Gunn, J. E., Ivezic, v., Kent, S., Lupton, R. H., & Vanden 
Berk, D. E. 2006, ApJS, 163, 80 

McGill, C. 1990, MNRAS, 242, 544 

McQuinn, M., Lidz, A., Zaldarriaga, M., Hernquist, L., Hopkins, 
P. F., Dutta, S., & Faucher-Giguere, C.-A. 2009, ApJ, 694, 842 

McQuinn, M., Oh, S. P., & Faucher-Giguere, C.-A. 2011, ApJ, 
743, 82 

Meiksin, A., & Tittley, E. R. 2012, MNRAS, 423, 7 
Meiksin, A., & White, M. 2001, MNRAS, 324, 141 
Meiksin, A. A. 2009, Reviews of Modern Physics, 81, 1405 
Miralda-Escude, J., Cen, R., Ostriker, J. P., & Rauch, M. 1996, 
ApJ, 471, 582 

Miralda-Escude, J., Haehnelt, M., & Rees, M. J. 2000, ApJ, 530, 1 
Miralda-Escude, J., & Rees, M. J. 1994, MNRAS, 266, 343 
Myers, A. D., Richards, G. T., Brunner, R. J., Schneider, D. P., 

Strand, N. E., Hall, P. B., Blomquist, J. A., & York, D. G. 

2008, ApJ, 678, 635 
Nath, B. B., & Biermann, P. L. 1993, MNRAS, 265, 241 
Nath, B. B., Sethi, S. K., k Shchekinov, Y. 1999, MNRAS, 303, 1 
Okamoto, T., Gao, L., & Theuns, T. 2008, MNRAS, 390, 920 
Parsons, A. R., Liu, A., Aguirre, J. E., Ali, Z. S., Bradley, R. F., 

Carilli, C. L., DeBoer, D. R., Dexter, M. R., Gugliucci, N. E., 

Jacobs, D. C, Klima, P., MacMahon, D. H. E., Manley, J. R., 

Moore, D. F., Pober, J. C, Stefan, I. I., & Walbrugh, W. P. 

2013, ArXiv e-prints 
Pawlik, A. H., Schaye, J., & van Schcrpcnzcel, E. 2009, MNRAS, 

394, 1812 

Peacock, J. A. 1999, Cosmological physics; rev. version 

(Cambridge: Cambridge Univ.) 
Peeples, M. S., Weinberg, D. H., Dave, R., Fardal, M. A., & Katz, 

N. 2009a, ArXiv e-prints 
— . 2009b, ArXiv e-prints 

Petry, C. E., Impey, C. D., & Foltz, C. B. 1998, ApJ, 494, 60 
Pfrommer, C, Chang, P., & Broderick, A. E. 2012, ApJ, 752, 24 
Prochaska, J. X., & Hennawi, J. F. 2009, ApJ, 690, 1558 
Prochaska, J. X., Hennawi, J. F., & Simcoe, R. A. 2012, ArXiv 
e-prints 

Puchwein, E., Pfrommer, C, Springel, V., Broderick, A. E., & 

Chang, P. 2012, MNRAS, 423, 149 
Rauch, M. 1998, ARA&A, 36, 267 

Rauch, M., Sargent, W. L. W., Barlow, T. A., & Carswell, R. F. 
2001, ApJ, 562, 76 

Reimers, D., Kohler, S., Wisotzki, L., Groote, D., 

Rodriguez-Pascual, P., & Wamsteker, W. 1997, A&A, 327, 890 

Richards, G. T., Nichol, R. C, Gray, A. G., Brunner, R. J., 
Lupton, R. H., Vanden Berk, D. E., Chong, S. S., Weinstein, 
M. A., Schneider, D. P., Anderson, S. F., Munn, J. A., Harris, 
H. C, Strauss, M. A., Fan, X., Gunn, J. E., Ivezic, v., York, 
D. G., Brinkmann, J., & Moore, A. W. 2004, ApJS, 155, 257 

Ricotti, M., Gnedin, N. Y., & ShuU, J. M. 2000, ApJ, 534, 41 

Ricotti, M., Ostriker, J. P., & Gnedin, N. Y. 2005, MNRAS, 357, 
207 

Rudie, G. C, Steidel, C. C, & Pettini, M. 2012, ApJ, 757, L30 
Scannapieco, E., Ferrara, A., & Madau, P. 2002, ApJ, 574, 590 
Scannapieco, E., & Oh, S. P. 2004, ApJ, 608, 62 
Schaye, J., Theuns, T., Rauch, M., Efstathiou, G., & Sargent, 

W. L. W. 2000, MNRAS, 318, 817 
ShuU, J. M., France, K., Danforth, C. W., Smith, B., &; 

Tumlinson, J. 2010, ApJ, 722, 1312 
Smette, A., Robertson, J. G., Shaver, P. A., Reimers, D., 

Wisotzki, L., & Koehler, T. 1995, A&AS, 113, 199 



Somerville, R. S. 2002, ApJ, 572, L23 

Syphers, D., Anderson, S. F., Zheng, W., Meiksin, A., Schneider, 

D. P., & York, D. G. 2012, AJ, 143, 100 
Tanaka, T., Perna, R., & Haiman, Z. 2012a, MNRAS, 425, 2974 
— . 2012b, MNRAS, 425, 2974 

Theuns, T., Mo, H. J., & Schaye, J. 2001, MNRAS, 321, 450 
Theuns, T., Schaye, J., &; Haehnelt, M. G. 2000, MNRAS, 315, 
600 

Theuns, T., Schaye, J., Zaroubi, S., Kim, T., Tzanavaris, P., &; 

Carswell, B. 2002a, ApJ, 567, L103 
Theuns, T., Zaroubi, S., Kim, T., Tzanavaris, P., &i Carswell, 

R. F. 2002b, MNRAS, 332, 367 
Tittley, E. R., & Meiksin, A. 2007a, MNRAS, 380, 1369 
— . 2007b, MNRAS, 380, 1369 

Viel, M., Bolton, J. S., & Haehnelt, M. G. 2009, MNRAS, 399, 
L39 

Viel, M., Matarrese, S., Mo, H. J., Haehnelt, M. G., & Theuns, T. 

2002, MNRAS, 329, 848 
Voit, G. M. 1996, ApJ, 465, 548 

Watts, P., Coles, P., & Melott, A. 2003, ApJ, 589, L61 
White, M. 2002, ApJS, 143, 241 

Worseck, G., Prochaska, J. X., McQuinn, M., Dall'Aglio, A., 
Fechner, C, Hennawi, J. F., Reimers, D., Richter, P., & 
Wisotzki, L. 2011, ApJ, 733, L24 

York, D. G., Adelman, J., Anderson, J. J. E., Anderson, S. F., 
Annis, J., Bahcall, N. A., Bakken, J. A., Barkhouser, R., 
Bastian, S., Berman, E., Boroski, W. N., Bracker, S., Briegel, 
C, Briggs, J. W., Brinkmann, J., Brunner, R., Buries, S., 
Carey, L., Carr, M. A., Castander, F. J., Chen, B., Colestock, 
P. L., Connolly, A. J., Crocker, J. H., Csabai, L, Czarapata, 
P. C, Davis, J. E., Doi, M., Dombeck, T., Eisenstein, D., 
EUman, N., Elms, B. R., Evans, M. L., Fan, X., Federwitz, 
G. R., Fiscelli, L., Friedman, S., Frieman, J. A., Fukugita, M., 
Gillespie, B., Gunn, J. E., Gurbani, V. K., de Haas, E., 
Haldeman, M., Harris, F. H., Hayes, J., Heckman, T. M., 
Hennessy, G. S., Hindsley, R. B., Holm, S., Holmgren, D. J., 
Huang, C.-h., Hull, C, Husby, D., Ichikawa, S.-L, Ichikawa, T., 
Ivezic, v., Kent, S., Kim, R. S. J., Kinney, E., Klaene, M., 
Kleinman, A. N., Kleinman, S., Knapp, G. R., Korienek, J., 
Kron, R. G., Kunszt, P. Z., Lamb, D. Q., Lee, B., Leger, R. F., 
Limmongkol, S., Lindenmeyer, C, Long, D. C, Loomis, C, 
Loveday, J., Lucinio, R., Lupton, R. H., MacKinnon, B., 
Mannery, E. J., Mantsch, P. M., Margon, B., McGehee, P., 
McKay, T. A., Meiksin, A., Merelli, A., Monet, D. G., Munn, 
J. A., Narayanan, V. K., Nash, T., Neilsen, E., Neswold, R., 
Newberg, H. J., Nichol, R. C, Nicinski, T., Nonino, M., Okada, 
N., Okamura, S., Ostriker, J. P., Owen, R., Pauls, A. G., 
Peoples, J., Peterson, R. L., Petravick, D., Pier, J. R., Pope, 

A. , Pordes, R., Prosapio, A., Rechenmacher, R., Quinn, T. R., 
Richards, G. T., Richmond, M. W., Rivetta, C. H., Rockosi, 
C. M., Ruthmansdorfer, K., Sandford, D., Schlegel, D. J., 
Schneider, D. P., Sekiguchi, M., Sergey, G., Shimasaku, K., 
Siegmund, W. A., Smee, S., Smith, J. A., Snedden, S., Stone, 
R., Stoughton, C, Strauss, M. A., Stubbs, C, SubbaRao, M., 
Szalay, A. S., Szapudi, I., Szokoly, G. P., Thakar, A. R., 
Tremonti, C, Tucker, D. L., Uomoto, A., Vanden Berk, D., 
Vogeley, M. S., Waddell, P., Wang, S.-i., Watanabe, M., 
Weinberg, D. H., Yanny, B., & Yasuda, N. 2000, AJ, 120, 1579 

Young, P., Sargent, W. L. W., Oke, J. B., & Boksenberg, A. 1981, 
ApJ, 249, 415 

Zaldarriaga, M. 2002, The Astrophysical Journal, 564, 153 
Zaldarriaga, M., Hui, L., & Tegmark, M. 2001, ApJ, 557, 519 
Zaroubi, S. 2013, in Astrophysics and Space Science Library, Vol. 
396, Astrophysics and Space Science Library, ed. T. Wiklind, 

B. Mobasher, & V. Bromm, 45 



APPENDIX 

A. RESOLVING THE JEANS SCALE WITH DARK-MATTER SIMULATIONS 

The Lya forest probes the structure of the very low density regions of the IGM, setting strict requirements on the 
resolution of our dark-matter only simulation. In particular, because our simulation is discrete in mass, each dark- 
matter particle represents a fixed amount of gas distributed according to the gravitational softening length and the size 
of the smoothing kernel that we use to represent Jeans smoothing (eqn. [5]). At very low densities, it is possible that 
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Fig. 15. — The probability distributions of the relative baryonic density A = p/p in our simulations. Each panel represent a different 
filtering scale Aj, which was used to smooth the dark matter density for the same three simulations, which have different mean inter-particle 
separations Al. When AZ is too large relative to Aj the IGM density is poorly resolved at low densities, and the PDF is not converged. 
Empirically, we find that a safe criterion for convergence is Al < Aj, which allow us to resolve Jeans scales down to 50 kpc with our 
Lbox = 50 Mpc and A''p = 1500'' simulation. 

a very large region is described by a single particle, and that most of this void region is left empty. This undesirable 

1 /3 

situation occurs when the mean inter-particle separation Al = L\^ox/Np , which defines the typical size of regions 
occupied by each particle, is much larger than the Jeans scale Aj, which is the minimum scale we want to resolve. 
Under such circumstances the density profile of skewers through our simulation cube will have many pixels which 
are nearly empty, because they have few or no neighboring particles. This insufficient sampling of the volume due to 
large mean inter-particle separation will then manifest itself through the appearance of artifacts in the volume- weighted 
probability distribution function (PDF) of the density. On the other hand, if the inter-particle separation is sufficiently 
small, the density field will be sufficiently sampled, and further decreasing the inter-particle separation will not alter 
the density PDF. Therefore we can define our resolution criteria for the mean inter-particle separation to be smaller 
than some multiple of the Jeans scale Al < aXj, where the exact value of this coefficient a is determined by checking 
that convergence is achieved in the density PDF. 

We estimate a by plotting the PDF of log(A) from our IGM skewers for a set of simulations with varying mean 
inter-particle separation, where A — p/p=l + Sis the density in units of the mean. The employed simulations have 
mean inter-particle separations = {86, 171, 653} kpc, corresponding to box sizes Lbox = {100, 250, 720} Mpc/h with 
iVp = {1500'^, 2048'^, 1800^} particles, respectively. In FigureHSlwe check for convergence using three different values of 
A J. The results indicate that a safe criterion for resolving the jeans scale is < Aj or a ~ 1. The simulation employed 
in this work has Lbox 50/i~^Mpc and iVp — 1500"^ particles, or a mean inter-particle separation of A^ — 48 kpc. 
This simulation thus allows us to study pressure smoothing down to a Jeans scale as small as ~ 50 kpc. Note however 
that the results of this paper rely on our estimation of the Jeans scale from various Lya forest statistics around the 
fiducial value of Aj = 110 kpc, so we are confident that the Jeans scale is resolved in our simulations and that our 
results are not impacted by resolution effects. 

B. DETERMINING THE CONCENTRATION PARAMETER ( OF THE WRAPPED-CAUCHY DISTRIBUTION 

For a given sample of phases {9} we employ a maximum-likelihood algorithm to determine the best-fit concentra- 
tion parameter C, which uniquely sp ecifies a wrapped- Cauchy distribution. This procedure is described in detail in 
iJammalamadaka fc Senguptal ()2001[ ). Briefly, we first reparametrize the wrapped-Cauchy distribution (eqn. [T9|) by 
writing v = 2(/{l + ('^), which gives 

PiO) oc ^— L_ , ^,|.). (Bl) 

Following the standard recipe of maximizing the logarithm of the likelihood with respect to the desired parameter, we 
sum the logarithms of the probability of all angles and impose the condition that its derivative with respect to v is 
zero, resulting in the equation 



0, 



(B2) 



which can be solved iteratively. The concentration parameter is then easily determined by inverting the above relation 
to get ( — {1 — \J\ — v'^')lv. This procedure is repeated for each distinct population of phases, parametrized by 
transverse separation rj^ and /c-mode, 9(r±,k), and for each model in the thermal parameter grid (To,7, Aj) that we 
consider. 



